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ABSIBACT 

Eaploiatory data analysis techniques were utilized to 
demonstrate the effectiveness of such techniques in identi- 
fying factors associated with attrition from the United 
States Army. Multivariate graphical data analysis was 
performed utilizing the "Draftsman” program recently added 
to the NES GRAFSTAT package, as well as other exploratory 
techniques. Empirical survivor curves which take into 
account and explicitly display the discrete probabilities of 
departure cf enlistees at 36 or 48 months are provided. 
Tables are provided depicting probabilities of attrition and 
reenlistment for selected personal characteristics of 
enlistees. 
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I. IHTEODOCTION 



1. E2CKCSOOND 

The incepticn cf the All Volunteer force in 1973 
provided Army nanpcwer planners with the challenge of 
attracting, recruiting and retaining high quality 
personnel. The ever-increasing technology on the battle- 
field coupled with budget constraints have forced manpower 
planners to search fcr an efficient alternative to sheer 
numbers. The soldier of today must be able to operate and 
maintain highly sophisticated equipment. In addition, the 
Army manpower planner must also cope with a decreasing 
supply cf 18-21 year olds. In fact, this cohort is 
predicted to shrink by about 15 percent by 1988 when 
compared to the 1979 cohort, and by about 25 per cent by 1994 
[Ref. 1: p. 2 ]. 

Cf course, manpower shortages in the army are nothing 
new. Past shortages have been both quantitative and quali- 
tative; the shortages historically have fluctuated over the 
years depending on the intricate balance among military 
requirements, civilian employment and wage alternatives. 
[Hef. 1: p.1] 

Currently Army recruiters have eliminated shortages. 
Through an extensive advertising campaign, army planners 
have taken maximum advantage of current economic conditions: 
since the inception cf the All Volunteer Force, the army has 
met its objectives in numbers of enlistees in all but two 
years (FY77,FI79) and has met 100 percent of objective in 
the last four years [fief. 2: pp.6-7J. 

The trends alluded to above, however, indicate that such 
ease in manning the force may be short-lived. Army manpower 



planners maj be forced to recruit "less-gualified" soldiers 
just to leet manning requirements. More screening will be 
necessary to meet these reguirements in an adeguate 
fashion. Cne particular screen that has seen widespread use 
is educatiot level [ Bef - 3; p«342]. Future recruiting may 
not result in the high percentage of High School Diploma 
Graduates that is currently enjoyed: from FY79 through FY83 

an average cf sixty percent of all non-prior-service enlis- 
tees have been High-School Diploma Graduates [Eef. 2; 
pp. 6-7]. 

Ihe FY85 Army Budget calls for holding active end 
strength to FY 84 levels £Bef. 4; p.16]. This in turn 

leads to maintaining an 80.6 per cent level of High School 
Diploma Graduate content [Ref. 4: p. 16]. In order to main- 

tain this level and maintain FY 84 end strength, a maximum 
of 12 per cent of total enlistees may be non-high-schcol 
diploma graduates (NESDG). 

In light of increased (due to inflation) or at best 
constant recruiting costs, army manpower planners must 
necessarily be concerned with determining exactly what level 
of education produces the best recruiting risk. In ether 
words, if Non-High- School Diploma Graduates and Graduate 
Eguivalent Degree enlistees are a necessary part of the 
force structure, what, if any, are the associations between 
education level and "performance"? This research effort 
will provide some insight to this guestion. 

Some commonly accepted measures of performance currently 
in use by army manpower planners are 

1. Attrition (various definitions and levels) , 

2. Skill Qualification Tests scores, 

3. Military judicial and non-judicial actions or lack 
thereof. 

[Eef. 5] 
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lie tern: attrition itself has taken on many different 

meanings in recent research. In many studies, attrition has 
teen defined as "failure to complete the first term of 
service." [Ref. 6: P-24] In this study total length of 

service obtained will be used as a measure of performance in 
the initial analysis and the above definition will be used 
in later more detailed analysis. 

Manpower policy makers have been investigating attrition 
since the early 60*s £Eef. 7: P-1]- Such research has 

attempted to predict attrition through various sorts of 
models. Across the Army, Navy, and Air Force, level of 
education, mental ability and age have been determined as 
the best "pre-service" predictor variables of attrition 
£Eef. 7: p-1]- 

The cost of "assessing, dressing and training" a typical 
soldier has been estimated at approximately $15000 £Eef. 8 
p.16]. Ihis initial cost resulted in a total cost of 

$1,743,200,000 in reaching FY 83 enlistment goals, based on 
116,215 accessions £Bef. 9: p. i ]. Obviously one means of 

meeting reguirements at minimum cost is to reduce 

unnecessary losses of money through premature attrition. 

Attrition studies have been, for the most part, based on 
different fcrms of regression models, particularly linear 
and Iccistic models using both individual occupations and 
occupaticnal groups. £Bef. 10: pp. 1-10] 

E. POEPCSE OP EESEABCH EFFCBT 

Ihe purpose of this thesis will be twofold: 

1. To demonstrate the usefulness of Exploratory Data 
Analysis (EDA) technigues in "preprocessing" large 
vclumes of data generally associated with any 
manpower analysis. This thesis will use a study of 
attrition of D.S. Army enlistees as the vehicle for 
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this demcnstxation. Ihe dependent variable under 
investigation is specified as "total active federal 
service" • This phase of the research will provide 
examples as tc how EDA techniques can assist manpower 
analysts and decision makers in determining prctlems 
in the data under analysis and in variable selection, 
(a discussion of exploratory data analysis techniques 
is found at Appendix A) 

2. Upon selection of suitaile predictor variables of 
attrition, an analysis of survival functions will be 
utilized to provide more detailed information. 
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II. DATA AMD METHOD OL OGY 



A. TEE EATA 

1 . CMDC C ohcr t file De scr i ption 

As stated, the data used to fulfil the purposes of 
this thesis was the FY79 COHORT file, maintained fcy the 
Defense Manpower Data Center (CMDC) at Monterey, California. 
This COHORT file is a longitudinal register of all acces- 
sions for a given year, updated at various predetermined 
times so as to allow for tracking of performance of that 
cohort in subsequent years. The FY79 cohort under investi- 
gation was last updated in September 1983. The file depicts 
each individual through 69 variables £Ref. 11]. FY79 was 
arbitrarily selected as a representative sample; it should 
be noted that the data from any given year may be confounded 
by political, social, and economic factors which are highly 
subjective and difficult to measure. 

2 • gre liminar v Investigation and Data Reductio n 

The data set was reduced based on a request for an 
investigation into ncn-high-school-graduate performance from 
the United States Army Recruiting Command (USARECJ , Fort 
Sheridan, Illinois. This request and subsequent telephonic 
requests for information suggested investigation into three 
military occupational skills (MOS) : specifically at least 

one MCS from each of the major subdivisions of the Army, 
namely Comhat Arms, Combat Support, and Combat Service 
Support. A histogram of the FY79 accessions (with non high 
school diploma graduate status) by MOS was developed 
(Appendix B) and subsequently the MOS’s rank ordered by 
numbers accessed. Eased on this ranking. Table I depicts 
the MCS's chosen for analysis. 
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TABLI I 

Bllitary Occcpational Skills To Be Analyzed 



Major Subgroup of Army 
Ccmbat Arms 

Ccnbat Support 
Combat Service Support 



MOS 

11B Infantryman 
1 1X Infantryman 
13B Artilleryman 

64C Hotor Transport 
Operator 

31H Multichannel Communica- 
tions Operator 

761 Supply Specialist 
94B Food Service 
Specialist 



In addition, the data set was further reduced to 
only ncn-prior-service male accessions, based again on 
conversations with OSAEEC. Of course, all education levels 
were included so as to be able to ultimately compare the 
effects of education level. A data reguest was provided to 
DMDC for the data described above; the final form of the 
data was in character form, stored in 5 files on the Mass 
Storage System of the Naval Postgraduate School computer 
system. 

3 . Pr e paration fcr Exp loratorv Data A nalysi s 

Eased on the above reduction of the data, the 69 
available variables cf interest were reduced to 1 4 variables 
to limit the scope of the investigation and to demonstrate 
the use of the Exploratory Data Analysis technigues. It 
should be noted that these procedures will be useful on any 
size data set for any number of variables subject only to 
the limitations of the storage capacity of the computer 
system in use. Table II provides a listing of this first 
selectic,n of variables. 
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TABU 11 

Initial Variables of Interest 



Expla natory V ari abl es Levels 

High School Level Obtained 13 
Current Fay Grade 31 
Marital Status (current) 2 
Nunher of Dependents (current) 9 
Character of Service 4 
Eeenlist lent Code 4 
Age at Entry 15 
High School Level at Entry 13 
Sex 2 
Race 3 
Ethnic Code 20 
Marital Status/No. of Dependents at Entry 20 
AICl Group (Mental Category) 3 



Depen den t V ariab le Levels 

Total Active Federal Service Number of months 



Total active Federal service was chosen as the 
dependent variable at this stage of the analysis to allow 
for investigation of possible associations with the abcve 
candidate predictor variables over time as opposed to a 
“go-nc-gc" binary representation of attrition. This depen- 
dent variable allows the decision maker to initially see the 
effects of the candidate predictor variables on different 
levels of attrition, whether the assesions contracted for 
three cr four years cf initial service. 

These variables having been selected, simple FOSTEAN 
and AEL programs (Appendix C and D) were written to retrieve 
the data from mass storage into an interactive environment 
for graphical analysis. 
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B- HITHCDOIOGI 



Exflcratory data analysis techniques are to be utilized 
to analyze the data described above. A draftsman's display 
[Hef. 12: pp. 136,145] is prepared to initially process the 

data. Association between variables of interest are deter- 
mined as well as any possible errors in the data. Upon 
necessary refinement, further Draftsman's displays are 
utilized to select possible explanatory variables. Bcxplot 
analysis is performed to analyze the distribution of the 
levels of the candidate explanatory variables and their 
contribution in determining length of service. A comparison 
of the statistics of each distribution is utilized to deter- 
mine relationships among the various levels of each of the 
candidate explanatory variables. Confirmatory analysis in 
the form of parametric and nonparametric hypothesis testing 
is presented to indicate the statistical significance of 
sample comparisons. 

Finally, a survivor function approach is utilized to 
analyze for probabilistic relationships. Failure times and 
survival times are identified that lead to calculations of 
the probability of attrition and reenlistment for both the 
three year enlistees (3Y0) and the four year enlistees (4Y0) 
from the FY79 COHOBT data. 
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III. ixploratobi data analysis 

A. INITIAL DBAFTSMAH'S DISPLAY 

As described in Appendix A, the draftsman's display 
[Bef. 12: pp. 136,1^15] is an efficient means of taking a 

"first glance" at a data set. In addition, the use of APL 
as a programming language fcr this analysis allows fcr rapid 
manipulaticn of large arrays in a user friendly fashion. A 
ten percent sample cf the data set consisting of the vari- 
ables listed in Table II above was run through the 
"draftsman" program [Bef. 13]. Sampling was performed by 
reading every tenth record of the data set provided by EMIC. 
The file was prepared in Social Security Account Number 
order so 10 percent sampling ensured that a country-wide 
sample was created. Also, the file being longitudinal, any 
length-biased sampling problems were avoided [Eef. 14: 
p.13]. The output produced was a 14 by 14 matrix cf two 
dimensional scatter plots. A segmented copy of the display 
is found at Appendix E. Refer to this Appendix for the 
following discussion. Note that the data has been 

"jittered" to reduce overlap of the data or data points with 
the same discrete values [Bef. 12: p.21j. (note: ceding of 

the variables is defined in [Ref. 11] ) 

First an overall view of the entire display is very 
useful to an analyst in several ways: 

1. Categorical data is rapidly identified by the 

"blocking effect" seen in most of the displayed vari- 
ables, e. g. , "Marital Status" vs. "MOS." This aspect 
of the display is critical in allowing the analyst to 
"see" the data and immediately determine where dummy 
variables, for example, may be necessary. 
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2. Also coding of the variables is displayed as in “Hs 

at entry " vs. "Total Service” One observes that 
the scale of education ranges from 0 to 13, with the 
najority of the data grouped at 4 and 13, which 
corresponds to 2 years of high school and a Graduate 
Equivalency Diploma (GED) , respectively. 

Fortunately, in this analysis, the file description 
containing the coding schedule was available; one can 
envision the csefulness of the graphical technique in 
"uncoding" data sets that may not have an accompa- 
nying description. The analyst could then recode as 
necessary with many of the commonly-used file 
macagement systems available. 

3. Errors in the data may be identified in a rapid and 
efficient manner. Again referring to "HS at entry" 
vs. "Total Service", the majority of the data is 
shown to be 2 year high school level (4) and GED 
(13) . Now the official request for data to DMDC was 
for all education levels. Because of a simple misun- 
derstanding and a misplaced operand in the code that 
extracted the data from the master cohort file, only 
NHSG data were provided. The use of the display, 
then allowed for the prevention of the costly mistake 
the unsuspecting analyst may have made in developing 
a model with erroneous data. This error-preventing 
aspect of this procedure manifested itself in 
subsequent displays as will be discussed later. 

4. Although further analysis was not performed, the 
display allows the analyst to determine 
multicollinearity/interaction effects of the concomi- 
tant variables. For example, "Age at entry" 
vs. "Number of dependents" plot may provide the 
impetus for further study, if either of the variables 
had a visual effect, on, say, total service. 
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Ih€ us€ of the entire display is simple and “intuitive”. 
It allows an analyst to bridge the gap, at least in seme 
fashion, between the quantitative world of the analyst and 
the “real” world of the decision maker through the power of 
the train’s visual correlation abilities. 

In the problem at hand, that is, the effect of various 
personal chairacteristics on performance (in the form of 
attrition or more generally, total service), analysis of the 
first column of the display is most revealing. again, this 
data set has been discovered to only contain NHSG and GED; 
the entire spectrum of education levels will be analyzed in 
a subsequent draftsman's display. The first column of the 
display depicts scatter plots of all independent variables 
versus total Federal service. To aid in the discussion, 
seven of the plots have been reproduced in Figure 3. 1 and 
the remaining seven in Figure 3.2 below. 

Viewing both figures, no rapidly discernible or 
“glaring” associations are evident. This is largely a 
result of the sheer size of the data set. 

However, much useful information is available: 

Figure 3.J, MCE Vers us Total S ervice : 

♦ This figure indicates that the largest number of 
assesions were HOS 11B (a “1“ in the DMDC coding) 
followed by 13B (2 on coding scale). No IIX's are 
discernible because this entry level "basic foot 
soldier” MOS was not created until 1980. MOS 64C 
seems to have a distinct break in length of 
service: this break indicated that perhaps further 

analysis is needed. One possible explanation may 
be that a portion of the accessions entered this 
MOS only for the training, and subsequently left 
the service at first opportunity through an admin- 
istrative discharge- MOS 76 Y (6 on DMDC scale) 

. seems to be the most successful in terms of 
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Figure 3,1 First CoIubo of Draftsman’s Display 

performance (i.e. , completion of at least 36 months 
of service) . 

2. Figure 3.^, AF£T Gro up Versus Total Servic e; 

• This variable combination is coded from 1 through 
8^ with 1 being a Category V. This lowest category 
corresponds to an AFQI total score of 9 or less. A 
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more 



slight trend is evident: as AFQT increases, 

data points are found in the higher total service 
range. This graphical analysis agrees with the 
literature presented in the introduction of this 
thesis. This also indicates that most of the 
accessions considered here (NHSG and GED) are found 
in categories 3,4, 5,6, which corresponds to mental 
categories IVB,IVA and IIIA, respectively. 
This leads to the conclusion, as expected, that 
NBSDG accession performance on the AFQT is in 
consonance with education level. 

F igure 3.1, Marital Status Versus Total Service : 

• This plot indicates that most NHSDG were single 
with no dependents (10 on the DMDC coding scheme). 
Again the mass of data points requires further 
sufcseguent analysis. The upper grouping depicts a 
slight increasing trend as accessions differ by 
number of dependents (20=married/no deps., 
21=married/1 dep. , etc.). This, along with ether 
aspects will be analyzed in more detail in section 
B, this chapter. 

4. F igure 3.1, Eth nic Code Versus Tota l Serv i ce ; 

• This plot indicates that a great preponderance of 
the accessions were "other” which corresponds to 
Caucasian in the DMDC coding. Puerto Ricans (cede 
4) indicate increased total service, leading to the 
conclusion that race is an important predictor of 
service and attrition. 

5- Fig ure 3.1, Race Ver sus Total Service : 

• This plot reinforces that of the ethnic code plct. 
Host assesiens were white, as indicated by the mass 
of data at cede 1. There is a slight increase of 
service indicated in the category 2 corresponding 
to blacks, and perhaps even a greater massing in 
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th€ 50-60 mcnth area of category 3, which ccrre- 
spcnds to race ’’other”. Again race seems to be a 
predictor of service or attrition. 

6- Figure 3.J, Sex Vers us Total S ervic e; 

• Although the data reguested from DMDC was for male 
accessions (coded 1), at least a few data points 
are evident in the category coded "2”, corres- 
ponding to sex "female". Subsequent analysis 
through an A£L program written to "scan" the data 
and count the frequency of certain data elements 
(Appendix D) indicated that in excess of 200 female 
NHSDG persons were accessed and recorded in this 
file. Again the draftsman’s display and more gener- 
ally the exploratory data analysis approach 
indicated erroneous data# perhaps preventing faulty 
analysis. 

F igure 3.J, High School at Entry Versus Tota l 

S ervice ; 

• Previously discussed above. This variable has seen 

widespread use as as a screening device in 
recruiting £Eef. 7: p« 1 ]• 

8. F igure 3,2, Ace at E ntry Versu s Total Service; 

• This variable combination indicates a wide range of 
values becatse of the more "continuous" nature of 
the age variable. Although most NHSDG accessions 
were in the 18-20 year category, this plot indi- 
cates a slight increase in total service as age 
increases to about 25. Then the plot may indicate 
that older accessions do not fare as well in the 
measure of performance chosen. This agrees with the 
literature; further investigation is deemed 
necessary by the display. 
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70TA1. SCRVICE 



Figure 3.2 First Colamn Continued 

F igure 3.2, Reenlist aen t Code Versu s Total Se rv ice; 

• This plot provides information that premature 
attritors and successful first- tour completers may 
receive the same reenlistment code. A "0" indi- 
cates unknown or no code, a ”1” eligibility to 
reenlist, a ’* *2" that a local bar to reenlistment 



27 



has been applied, a ”3" that a DA-ievel EAE has 
been applied, and a ”4” that the soldier is totally 
ineligible. The DMDC file description only 
provided 2 codes:- again the display has indicated 
erroneous data or perhaps changed codes sutseguent 
to latest file update. This display prompted a 
telephonic clarification to DMEC that may not have 
been made in the absence of exploratory data anal- 
ysis. Since both "failures” and "successes" can 
receive a cede of "eligible", this variable is not 
useful as a predictor of attrition. 

10. F igure 3.2, Chara c te r of Service Versus Total 

S ervice : 

• This variable combination, like reenlistment cede 
above, is an indicator of "after the fact" perform- 
ance, either premature attrition or successful 
first-term completion. Again, a "0" indicates 
unknown or uncoded data, a "1" an honorable 

discharge, "2" under honorable conditions, a "3" 
other than honorable conditions, and a "4" a 

dishonorable discharge. A massing of the data in 

the 0-20 month area of honorable discharges indi- 
cates that premature attritors were granted such a 
discharge through perhaps the Trainee Discharge 
Pregram or Expeditious Discharge Program. Thus 
this variable combination is not a good predictor 
of performance since discharge award in seme cases 
is based on the local commander’s decision. Quite 
often the honorable discharge may have been granted 
to speed tie discharge of a substandard soldier; 
Under Honorable Conditions discharges, Other Than 
Honorable discharges and Dishonorable discharges 
require more "red tape" and administrative delays. 
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F igure 3,2, Number of Dependents Versus lo tal 

Service ; 

• This plot is an updated data entry that allows for 
the tracking of the addition of dependents through 
the serviceman's tenure. The plot indicates a 
majority of the service people considered through 
September 1983 still had zero dependents (coded 
”1”). As number of dependents increase, total 
service seems to increase, although there is a 
proportion cf data points with more dependents that 
failed to ccmplete 36 months of service. This plot 
indicates that with further analysis, number of 
dependents may be a predictor of total service and 
attrition. 

12. F igure 3-2, M arital Status Versu s Total Service ; 

• This variable is an updated variable indicating 
current marital status of the service member, a ”2” 
for married and a ”1” for other [Eef. 11]. The 
plot indicates a majority of the service members 
are still single. Married service members' seem to 
demonstrate an increase in total service, indi- 
cating that this variable is a candidate predictor 
variable reguiring further analysis. Again , this 
agrees with the literature. 

13. Fig ure 3. 2, Pay Grade Versus Total S ervice : 

• Although pay grade is not really a relevant means 

of predicting performance of accessions, this plot 
gives a gocd indication of the "intuitiveness” of 
the graphical depiction of the data. This plot 
indicates an "ideal" upward trend: as months of 

total service increase, the mass of data moves 

upward and to the right. High performers are those 
that receive promotions earlier than the mass in 

.that particular pay grade, for example, grade E5 
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indicated bj a 5 on the vertical axis of the plot 
shows that the majority of the service members were 
promoted in the 48-60 month time frame, which is 
the norm. However , observe the points in the 38-40 
month area, indicating waiver or early promotion. 
From this plot, one can ascertain some idea of the 
number of achievers in this cohort. Along the same 
lines, the poor performers are evident in, for 
example, tie El grade in the 50 month time frame. 
The entire plot seems to indicate that there is the 
expected upward motility of the average soldier. 

Figure 3.2, High Schoo l Obtained Versus lotal 

Serv ice ; 

• Although massive data size obscures the plot, this 
plot does indicate the some of the NHSG accessions 
have completed their GED requirements (a”6" on the 
scale) during this time frame. Of these that have 
completed, a slightly increasing trend is evident, 
again reinforcing the literature that education 
level is a suitable independent or predictor 
variable on attrition. 

In summary this first draftsman's display has demon- 
strated that the graphical (EDA) procedures are critical in 
identifying erroneous data, determining variables of 
interest, and identifying multicollinearity/interaction 
effects. Now a more refined version, with an even further 
reduced list of candidate variables and in some cases, with 
variables that have been recoded so as to be more intui- 
tively appealing was produced. This display is in segmented 
form at Appendix ?. This version of the display was devel- 
oped from the more general data set of FY79 accessions, this 
time including all education levels. 
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B. BI7ISED DEAFTSMAH'S DISEIAY 



The initial explciatory analysis above revealed that 

1, the data cn hand was not suitable in that it did not 
contain High School Diploma Graduates (HSDG) for use 
in comparing any effects of education on performance; 

2. the list of variables under investigation could be 
further reduced. 

The purpose of this second iteration of draftsman’s analysis 
was tc serve as a final check on the data and to determine 
any other relevant information from the data prior to a more 
detailed investigation utilizing other methods. 



TABLE III 

Reduced Variable List for Further Analysis 



D epend ent 7 ari able 

Total Service 



Eaplana torv V ariable 

Military Occupational Skill (MOS) 
Marital Status/Numter of Dependents 
Bace 
S € X 

Level of Education at Entry 
Age at Entry 
Mental Category 



G ene ral Perf ormance Indicators 

Be enlistment Code 
Character of Service 



The data shortfall in education level alluded to above 
was sclved in the submission of a request to DMDC for a mere 
complete data set. This second data set was received and 
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again stored in 5 files in the Mass Storage System of the 
NPS Computer System. The FY79 COHOBT file consisted of 
30778 records of data. Files for the FY80-FY83 COHOEIS were 
also acquired for later model validation and sufcsecuent 
research. It should he noted that the target data requested 
was to be non- prior- service (NPS) male enlistees. The vari- 
ables under investigation for this phase are listed in Table 
III . The last two variables in the table were/ as previ- 
ously stated/ not to be considered as predictors tut as a 
means of assessing general performance of the enlistees. 

Again, FORTRAN and APL programs (Appendices C/E) were 
utilized to retrieve the data from mass storage and to 
manipulate it into form for interactive analysis. A ten 
percent sample of the data was taken for analysis (3C78 
records) . The data was again jittered to reduce overlap. 

An overall view of the draftsman's display (Appendix F ) 
demonstrated the following: 

1. The new data set is mostly categorical as expected. 

2. All levels of education have been included as demon- 
strated by the "HS at Entry vs. Total Active Service" 
plot. 

3. The total service scale on all plots extends to 160 
months, indicating that at least some prior service 
enlistees have been erroneously included in the data. 

4. Some female eilistees have been included as indicated 

by the "sex vs. Total Service" plot, again 

demonstrating erroneous data. 

Most of the discussion below centers on the first column 
of the display; hence this column has been reproduced in 
Figure 3.3 and Figure 3.4 Viewing both figures simultane- 
ously, the massing of the data points in heavily concen- 
trated "blocks" demonstrates the large number of data points 
in the sample. The dimensionality of the problem is graphi- 
cally e.vident. Some specific information that can be 
gleaned from this display follows. 
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Figure 3,3 First Column of Revised Draftsman's Display 

Figure 3.3, Kilitarv Occupational Skill ys. lotal 

S ervice ; 

• The 11B and 13B occupational skills (1 and 3 in the 

DMEC coding) appear to have attracted the most 
enlistees. No discernible trend or association is 
apparent in the length of service for these two 
combat arms occupational skills. Skill 64C (coded 
4) demonstrates the "break” in length of service 
just as in tie non-high-school-graduate data in the 
previous section. Occupational skill 76Y, (6 on 

the DMDC scale) as before, appeared most successful 
with less soldiers attriting within 36 months of 
service, or the normal first tour length. 

2- F igure 3.3, M ental C ate gory vs . Length of Se rvic e; 

• This variable corresponds to "AFQT group" in the 
, original display, and has been renamed based on the 
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information seen in tie first draftsman’s display. 
This variable has also been recoded to reflect a 1 
for mental categories 5 and 4c, a 2 for mental 
categories 41 and 4a, a 3 for categories 31 and 3a, 
and a 4 for categories 2 and 1. This plot demon- 
strates the recruiting policy of targeting the 
higher mental categories by the massing of the data 
in those respective areas. 

3. F igure 3,3, Marital Status at Entry vs . Total 

Service : 

• This variable is coded with a 10 for single with no 
dependents through 19 for single with 8 dependents, 
and then a 20 for married, no dependents, through 
29, for married 8 dependents. Most enlistees in 
this data file were single with no dependents. The 
married enlistees (20 and higher on the DMEC scale) 
indicated a slightly increased total service, 
particularly as numter of dependents increased 
(21,22 and 23 on the scale) 

4. F igure 3.3, Race vs. Total Service ; 

• The addition of the high-school-diploma graduates 

has not affected the pattern that was evident in 
the original draftsman’s display: a preponderance 

of whites (ceded 1) followed by blacks (ceded 2), 
and others (coded 3) is still evident. The "ether” 
category still demonstrates increasing length of 
service. 

Figure 3.3, Sex vs. S ervic e; 

• As previously stated, this plot demonstrates that 
the supposed all male non- prior-service file has 
included in it a number of female enlistees. Also 
note that the massing of the data indicates that a 
large proportion of these females attrite with less 
than twenty months service. 
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Figure 3.4 First Column, Continued 

6. Fig ure 3.4, Hiah Sch ool at Entry vs . T otal S ervice : 

• The inclusion of all levels of education can new be 
verified. The plot has been recoded utilizirg an 

API program (See Appendix D) as shown in Table IV 

below: The massing of the data at position 5 indi- 

cates that most enlistees in this sample were high 
school diploma graduates. Also, enlistees with two 
to four years high school outnumber those with 
equivalency status. No discernible differences can 
be observed in the equivalency certificate holders 
over the other levels of education due to the 
massing of the data. Note the distinct break, 
though, in the GED length of service. A grouping 

is evident for zero to twenty months of service. 
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lABlE IV 

Recoding of Education Levels for Draftsman’s 

Display 



Dp to 1 year of high school 
2 2 years nigh school 

3.. ... 3-4 years higii school, no diploma 

4.. ... Graduate Eguivalency Diploma 

5 Eigh School Diploma Graauate 

6 At least 1 year college and higher 



and another for approximately thirty tc forty 
months. This may be partially explained by viewing 
age at entry versus high school at entry along the 
same row as high school versus total service in 
Figure 3.5. This same break is evident; the 





Figure 3.5 Bov of Display for Cross Comparison 

majority of the enlistees with the eguivalency 
certificate (GED) are 20 years old and below with a 
break below 20. Perhaps this break in age wien 
viewed with the break in service indicate that the 
younger GEE enlistee is not as successful in 
completing service, just as he was in not 
completing high school. 
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7- Fig ure 3.4, Ace at En try vs. Total Servic e ; 

• The massing cf the data indicates that most enlis- 
tees fall in the 17-21 category. As age increases, 
tctal service again seems to be divided into 
distinct branches. These branches could possibly 
he related tc marital status, number of dependents, 
or education level, where breaks of this nature 
were also evident. See the entire display to 
compare each of the above mentioned variable ccmti- 
nations versus length of service and against ctler 
possible ccmhinations of other variables for 
possible insights. 

Figure 3.4, Re enlistmen t C ode ( " Reup " code) vs. Total 

S ervice ; 

• This plot shews that a surprising number of enlis- 
tees, both premature ••leavers" and successful 
(i.e. , with more than 36 months service) "stayers", 
have an uncoded 0 reenlistment code. This pretahly 
indicates that the record has not been posted with 
this information. Code 1, corresponding to 
"eligible fer reenlistment", is massed after 36 
months, indicating that completion of at least the 
first term is a requirement for reenlistment eligi- 
bility. Cede 2, a local bar to reenlistment, is 
also massed around 36 to 40 months of service, 
indicating that the decision to allow reenlistment 
is often reserved for the end of a soldier’s term 
of service. It should be noted that the bar to 
reenlistment can be issued by the local cemmander 
at any time deemed necessary, yet it appears that 
exercising cf this powerful option is not being 
done. Code 3, the Department of the Army bar to 
reenlistment, indicates a uniform massing cf the 
data. This agrees with Department of the Army 
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pclicy to automatically bar soldiers that have 
received certain recognition as substandard through 
judicial, ncn- judicial, and administrative actions. 
Cross comparison with the next column cf the 
display (see Figure 3.6) reinforces this idea: 
This plot is reenlistment code versus character of 
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Figure 3.6 Segment for Cross-Comparison 

service awarded. Beenlistment code 3 has the 
majority of the character of service codes 2 and 3 
corresponding to "under honorable conditions” and 
"other than honorable " discharges respectively, 
both of which are considered substandard. 

Similarly, code 4 in reenlistment signifies 
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ineligibilitj for reenlistment, which is ncticeatly 
less of the data, contains the worst character of 
service code, a dishonorable discharge indicated by 
a 4 on the character of service axis. 

Figure 3.4, Characte r of S ervic e vs . Leng th of 
Service ; 

• This plot indicates that as length of service 
increases tc 38-40 months, character of service 
code decreases. The DMDC coding [Ref. 11] lists the 
coding as 1 for an honorable discharge, so one can 
infer that "stayers” generally receive honorable 
discharges. Those unsuccessful soldiers with less 
than 36 months service receive the majority of the 
"under honorable conditions", "other than honor- 
able" and "dishonorable" discharges, coded 2, 3, and 
4 respectively. 

This revised draftsman’s display analysis has provided 
some insights into those personal characteristics affecting 
length of service and hence attrition. All seven possible 
explanatory variables under consideration have demonstrated 
at least some effects on length of service. The effects of 
education level, mental category, marital status and number 
of dependents, and age have at least initially seen to be 
most profound. 

These displays have served to provide an initial graph- 
ical view of the data that is both intuitively pleasing and 
simple. In addition to identifying possible erroneous data 
entries and any peculiar coding of the variables, a major 
result of the analysis has been a reduction in the dimen- 
sionality of the data itself. The displays also aid in a 
general familiarization with the data under investigation. 
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C. ECIilOl ANALYSIS 



The boxplot will be utilized to demonstrate another 
Exploratory Data Analysis technique that provides a powerful 
means of obtaining more information about a data set. (See 
Appendix A for a discussion of boxplots in general.) 

1 . Edu cat ion Ley^ Ver sus Length of S ervice 

In order to demonstrate the effectiveness and power 
of boxplots, one of the six candidate explanatory vari- 
ables, high school education level at entry, will be 
investigated in detail. As stated in the introduction, the 
literature has pointed out that education level at time of 
entry has been a generally accepted predictor variable in 
attrition analyses during the last ten years. Therefore a 
boxplot analysis utilizing the category subpopulaticn anal- 
ysis program in the IBM experimental GRAFSTAT package is 
presented in the following discussion. Length of service, 
the chosen dependent variable for this portion of the study, 
is plotted versus education level at entry. 

The left panel of Figure 3.7 is a depiction of a ten 
per cent sample of FY79 enlistees as of September 1983. As 
previously stated, this data set is a smaller subset 
consisting of seven military occupational skills from the 
entire FY79 COHORT file from DMDC. The GRAFSTAT input 
screen for this program allows the analyst to subdivide this 
batch of data into its seven component occupational skills 
through the input of a simple "category" vector. (See 
Appendix G for a depiction of this input screen.) The y 
axis is length of total active service in months, while the 
X axis indicates education level at time of entry into the 
army. The non high school diploma graduate level is indi- 
cated by "NBSDG” and indicates three to four years of high 
school without a diploma. Education level decreases toward 
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Figure 3.7 Education Level vs. length of Service 



the origin# with a 4 indicating 2 years of high schccl# a 3 
indicating 1 year of high school# a 2 indicating 2 years of 
junior high school# aid a 1 indicating 1 year of junior high 
school. To the right of the position marked "NHSDG"# educa- 
tion level increases until the position marked "GEE" which 
indicates a graduate equivalency "diploma" or certificate. 
A 6 indicates high school diploma graduate# a 7 indicates 1 
year cf college# 8 fcr 2 years of college# 9 indicates 3 to 
4 years college hut nc degree# and a 10 indicates a college 
degree. Eleven and 12 indicate a masters and dcctcrate 
degree# respectively. 

Viewing the entire display in the left panel# the 
hoxplct provides a graphical statistical summary of the 
distribution of each of the suhcategories (i. e. # the levels 
of education) in a form for easy comparison. A table of 

values is also presented beneath the display. The mean of 

each subcategory is depicted by the dot (with lines 
connecting the subcategories). The median of each subcate- 
gory is depicted by the other dot in the body of the bcx. 
Adjacent values and their associated "whiskers"# or the 
lines drawn from the body of the box to the adjacent values 
depict tie tails of the distribution of each subcategory. 
Outliers are depicted by heavy dots and are defined as these 
values greater than 1.5 times the interquartile range of the 
distribution. The mean of the entire display across all 
subcategories is depicted by the arrowhead on the y axis. 

It is immediately apparent that this subpopulation 
contains some enlistees with prior service# indicated by the 
outlier values in tie left panel of figure 3.7 that have 
more than 60 months cf service. {If an enlistee entered the 
Army on 30 September 1978, max length of service without 
prior service is 60 aenths through 30 September 1983.) The 
data requested from EMDC was to be non-prior service (NES) 
only; again the important error-indication quality of this 
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technigue is readily apparent. The non-prior service enlis- 
tees can fce easily extracted from the data set in the 
interactive mode of this package; by the insertion of a 
simple truth statement (e.g., length of service <60 months) 
the data set is reduced to the desired set. (See Appendix G 
for the depiction of the input screen; selection is entered 
in the ”selec:tion” area.) This feature is also most useful 
in the comparison of certain other characteristics of the 
data as will be demonstrated. 

The ability to interactively subdivide the data 
according to other variables of interest is an extremely 
powerful tool, allowing the analyst to "compose” his areas 
of comparison and graphically determine associations in a 
multivariate sense. The actual data is never altered and 
there is no delay due to timely resubmissions of programs. 
In addition the graphical displays are rapidly available and 
intuitively appealing , requiring little explanation to 
those decision makers with less background in classical data 
analysis techniques. 

The right panel of Figure 3.7 indicates the results 
of removing the outlier enlistees with prior service. The 
scale of the boxplots is now larger so the mean, median, and 
spread can be readily ascertained. Note that the mean 
length of service has been only slightly modified by the 
removal of the 5 outliers ( determined by the "ALL" row of 
the two tables below the plots, 3078-3073) , as expected. 
Also note that the shape of the boxes before and after the 
removal of the outliers is the same, indicative of the 
resistance of the bcxplot. The right panel of the figure 
shows that performance in the form of length of service 
tends to increase as education level increases from the 
junior high level to high school diploma graduate status. 
College level enlistees and higher do not demonstrate this 
trend. Note that enlistees with the graduate equivalency 
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certiiicate demonstrate poorer performance than ncn-fcigh- 
school-diplcma gradcates (NHSDG) . The variance of the 
distribution of the enlistees with the GED is greater, 
perhaps due to the manner in which the GED may be awarded: 
Persons with any level of education can test for and gain 
the GEE, and numerous tests are available for the certifi- 
cate. Hence the large variance may be explained by the 
"variance" of the means of awarding the certificate. 

In the next figure, (Figure 3.8), the non-prior- 
service enlistees are again presented in the left panel as a 
"comparative other." In the right panel of the figure, all 
those soldiers who had an uncoded education level and those 
with PhD degrees have been deleted, again utilizing the 
selection feature of the GRAFSIAT program. This results in 
a "smccther" line through the means, allowing a more discer- 
nible view of the association of each level of education and 
length of service. Note from the table that only 2 persons 
were in these deleted categories, hence they are referred to 
as "cutlier" education levels in the title of the plct.' 
These points will remain deleted throughout the remainder of 
this particular analysis. 

In Figure 3-S, left panel, the previously discussed 
boxplct with "outlier" education levels deleted, ncn-pricr- 
service enlistees is again presented for further comparison. 
In the right panel, utilizing the selection capability, all 
those enlistees with any years of college have been deleted. 
Now the increasing trend is clearly evident. As education 
increases, length cf service increases. Again these 
soldiers with the graduate equivalency certificate (GED) 
demonstrate a lower mean performance than both the nen-high- 
school-graduates and the high school diploma graduates. To 
isolate this trend even further, GED soldiers are deleted in 
the right panel of Figure 3.10 Now, compared to the left 
panel of this display, the upward trend is most evident. 
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Figure 3.8 Education Level vs. Length of Service/II 
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Figure 3.9 Education Level, College Deleted 



The distributicn of each education level is graphi- 
cally described in the boxplot. For example, in Figure 3.10, 
the high-school-diplcra-gra duate distribution appears to be 
symmetric as indicated by the position of the mean and 
median relative to the ends cf the body of the box. Ihe 
distributicn appears to have a "thicker” tail in the loner 
length of service side, depicted by the longer whisker on 
the bcttcm cf the boxplot. Rapid comparisons of the distri- 
butions cf each of the subcategories can be done through the 
hoxplct analysis. 

Ihus the use cf boxplots has indicated that level of 
education dees have an effect on performance in the form of 
length of service. It should be reiterated that this phase 
of the analysis is exploratory in nature; the comparison of 
means could be strengthened through such confirmatory anal- 
ysis as a one-way ANCUA or in a non- parametric test such as 
the Kruskal-Hallis test of equality of means. An example of 
this confirmatory analysis will be provided in a later 
section cf this thesis- 

2. Ihe Multivariate Ca pabilitv of the Boxplo t 

Military occupation al skill of the enlistee will be 
examined for any indication of an association with perform- 
ance, and then further analyxed with regards to education 
level within the occupational skill. Seven military occupa- 
tional skills (MOS) were considered in this analysis. 
(Refer to Table I, previous chapter) . Again a ten percent 
sample of the data was utilized for analysis. 

Figure 3.11, left panel, provides the first lock at 
this suheategory of the data. Military occupational skill 
is plotted against total active service. Each military 
occupational skill is presented on the x-axis, beginning 
with combat arms (1 IB, 11X, 13B) , combat support (31M, 64C), 
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figure 3. 10 Education Level, GEO Deleted 



ajid ccmtat service sufport (76Y,94B). As before, the distri- 
tuticB of each military occupational skill with respect to 
total service is presented by the borplot. Again note the 
outliers in total service; these same 5 soldiers have been 
deleted in the right panel so as to expand the scale of the 
boxplcts for analysis. Through the graphical depiction of 
these distributions, the occupational skill most successful 
in terms of service can be readily determined. Here, skill 
76Y, supply specialist, demonstrates the greatest success 
followed by 11B, Infantryman. 

Ihe spread cf the distribution can be observed in 
the bcdy of the bcx; note that the deviation is also 
provided in the table below. The 11 X occupational skill is 
noticeably vacant. This MOS was created as an entry level 
of infantryman in FY80, hence this data set contains none of 
this general skill. Comparing the three branches of skills 
described above, combat arms was most successful in perform- 
ance as determined by viewing 1 1B and 13B boxplots collec- 
tively. Ccmbat service support was next in order of mcnths 
successful service followed by combat support. 

The shape of each of the distributions is readily 
apparent in the boxplcts. Each military occupational skill 
appears to be skewed toward the lower range of service as 
indicated by the postion of the median inside the bcdy of 
each boxplct. The variances appear relatively constant 
except for skill 64C. The utility of the boxplot in deter- 
mining homogeneity cf variance, for example, for subseguent 
regression analysis or for ANOVA assumptions is readily 
obvious. 

In Figure 3. 12, left panel, the previous figure of 
non-prior-service-only enlistees has been reproduced to 
allow ccmparisons. In order to isolate the effects of 
education level, all high- school-diploma graduates (and 60 
enlistees with at least some college) have been selected cut 
cf the subpcpulation, using the selection capability. Thus 
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Figure 3.11 Military Occupational Skill vs. length of Service 



in th€ right panel/ non-prior-service, high school diploma 
graduates have been plotted against total active service. 
Immediately note that a higher mean service is observed by 
these tetter educated enlistees. The variance of 

occupational skills has been reduced indicating that this 
subpopulation is not as great a risk when considering bow 
long each will serve. Those soldiers in skill 11 E demon- 
strate the highest success {mean 36.15 months). Eased on 
these graphical results high school diploma graduates 
perform tetter than the average non- prior-service enlistee, 
and more specifically, comhat arms skills rank first 
followed by combat support, and then combat service support. 
This may seem counterintuitive as technical skills required 
are thought to be higher in the combat service support 
tranches. 

Figure 3.13 repeats the same sort of analysis, this 
time comparing non-prior service enlistees total service in 
the left panel to non- prior-service enlistees that have not 
received a diploma (NHSG) and those that have received a 
graduate equivalency diploma (GED) . Note that "success” has 
fallen from 31.85 months to 29.08 months. Occupational 
skill 761 has the highest mean success, and combat service 
support now leads in total mean service. Also note that 
occupational skill 31H demonstrates a much higher variance 
when only these non-high- school graduates are compared to 
the total population. This could perhaps be explained by 
the lack of any real standards in how the GED is awarded. 
This certificate can be awarded at any level of education, 
provided that one of numerous tests has been passed. Also 
the non-high-school g^raduate with ,say, only a 10th grade 
level of education could account for extremely poor perform- 
ance in this somewhat technical microwave operator skill. 
Cn the other hand, those enlistees with an equivalency 
degree or 11- plus years of education from a "technically 
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MOS VS SERVICE, NPS ONLY MOS VS SERVICE. NPS ONLY.HSC AND HIGHER 
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Figure 3.12 Occupational Skill, High School Graduates 
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Figure 3.13 Occupational Skill, NflSG and GED 



oriented” school system could account for the high perform- 
ance. Here research could te performed in the area of 
"levels” of equivalency status and in a demographic analysis 
of those non high school graduates. 

A comparison of high school diploma graduates to 
non-high-school-diplcma graduates is presented in Figure 
3.14 . Those soldiers with equivalency certificates (GFD) 
have teen included in the non high school graduate category. 
In the left panel of this figure, the high school graduate 
category is presented for comparison. Again, this plot is 
non prior service only graduates. The right panel is non- 
prior-service, non-high-schocl-graduat es and equivalency 
certificate holders. The "tetter educated” enlistees ,on 
the average, have outperformed those with less than a school 
degree (as shown ty the mean values both in the boxplots and 
the tabled values helow the plot). Skill 76Y, supply 
specialist, in the right panel, indicates a tetter perform- 
ance, (mean of 33.6) than the same occupational skill with 
a high school diploma (mean 32.7) . The variance of the two 
distributions is roughly the same. Again personal charac- 
teristics and the technicality of the skill involved along 
with the effects of mental group distributions, enlistment 
tonus differences may explain this anomaly. 

Since a difference has been established in perform- 
ance based on the certificate status at entry, the tcxplot 
can be utilized to go even into further detail. In Figure 
3. 1 5 ,non-high-school-gradua tes only are compared tc high- 
school-diplcma graduates and in Figure 3.16 those with 
equivalency degrees are compared to the diploma-carrying 
soldiers. In each figure, the batch remains limited to 
non- prior-service enlistees, varying only education levels. 

Figure 3.15 demonstrates that the non-high-schocl- 
graduate performance , depicted in the right panel, was 
below diploma 'graduate performance. This is seen 
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graphicallj by comparing the relative position of the mean 
lines and in the tabled value below each plot. Going 
further, a compariscn of each individual occupational skill 
in the left panel to its counterpart in the right panel 
indicates that the varied educational level produces and 
entirely different distribution. This is observed through 
the location of the respective means and medians, the size 
of the body of the box, and the length of the”whiskers" or 
tails of the distribution. 

In Figure 3. 16, this same difference in the distri- 
bution of military occupational skills with respect to 
education level is again obvious. In every case, the high 
school diploma graduates outperform the soldiers with equiv- 
alency status. Again, GED holders exhibit a larger variance 
as indicated by the tody of the box, indicating a higher 
risk in attrition. 

The entire analysis presented on the effects of 
education level within military occupational skill is summa- 
rized in Figure 3.17 , where the baseline of ncn-prior 
service enlistees, categorized by occupational skill versus 
length of service is displayed simultaneously. Education 
level has teen selected in each plot. Education level, 
military occupational skill, and length of service have been 
integrated into a single display. Any other combination of 
variables such as marital status, age, race, could be 
further selected to provide more of a multivariate display. 
The lEA techniques ccmbined with the IBM GEAFSTAT package 
allows for any combination of covariates in an analysis, 
limited only by the imagination of the analyst. This 
display allows a rapid comparison of the effects of educa- 
tion level on performance (in the form of length of 
service) , perhaps providing a strong argument in favcr of 
these graphical methods for at least initial decisions 
regarding what level of education to recruit. Again, 



55 



confirmatory analysis is necessary for a more refined 
analysis of relative merits of alternative educational 
policies. 

3. Summary cf Bo rplot A nalysis of R emaining Va ri a bles 

Ihe remaining possible explanatory variables and the 
two "general performance indicators" were analyzed in the 
same manner as presented above. In each case the effects of 
education level were observed in a multivariate sense by the 
use of selection combined with the pairwise comparison of 
length of service and the other candidate explanatory vari- 
ables. Ihe actual bcxplots are found in Appendix H. Again, 
confirmatory analysis should be performed to verify the 
statistical significance of any conclusions drawn from this 
exploratory analysis. A summary of this analysis is 
presented below: 

Ment al Categ ory Versus Lengt h of S erv ice; (See 

Figures H.1 through H.9, Appendix H) 

• As mental category increases, length of service 
increases. Category 4c outperform Categories 
4b, 4a, 3b. 

• High school diploma graduates outperform the ncn- 
high-school-graduates and equivalency certificate 
holders (GEC) in all categories. 

• Nc category 1 soldiers were observed in the ncn- 
high-school-gradua te or GED categories, 

• Non-high-schcol-gr aduates outperformed the equiva- 
lency certificate (GED) holders in every category. 

• Variance in the non-high-school graduate and GED 
performance is higher than that of the diplcma 
graduates, indicating higher risk in attrition. 
Equivalency certificate (GED) holders’ variance was 
observed to be higher than non-high-schocl- 
graduates, again indicating a higher risk. 
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.14 HSDG, NHSG and GED by Occupational Skill 
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Figure 3.15 Non High School Grads vs. Diploma Grads 




w. 



MOS VS SERVICf. NPS ONLY.HSC ANO HIGHER MOS VS SERVICE. NPS ONL'r.CEO ONLY 





<0 ^ 

A ^ ^ *4 



«r» 



O ^ »» «A « 







5>-» 



5 

ar 



'j* 

a 



i 



O 

w 

< 

o 



^ ♦ o ^ 

• «>i Q ^ irt 

*> ^ 




tf> le ^ ^ 

— • — » 
• ♦ ^ M w «Vt 



— «o 






I 

I 

I 



I 



59 



Figure 3.16 Eguivaleocy Status vs, Eiploma Grads 
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Figure 3.17 Becap of Effects of Education on Performance 



2 . 



Ma ri tal Status and Number of D ependent s vs. Len^^Ui of 
S ervice : (See Figures fl.lO - H. 15, Appendix H) 

• Most enlistees were single with no dependents at 
tine of entry (92%) 

• Single enlistees with 1 dependent had highest mean 
performance. 

• In general, married enlistees outperformed single 
enlis te es. 

• As number of dependents in married soldiers at time 
of entry increased, sc did performance, (note: true 
up to 3 dependents.) 

• High-school-diploma graduates led in performance, 
followed by non-high-school-graduates, then 6ED 
holders. 

3- A ge Versus Le n gt h of Service: (See Figures H.16 

through H.24, Appendix H) 

• As age increases (at time of entry) from 17 to 19 
years, tctal service (performance) increases, 
followed by a leveling off in the 19 to 24 year 
range. Performance decreases as age incxeases from 
24 to 29 years old. 

• High-school-diploma graduates outperformed ctler 
less educated entrants, non-high-school graduates 
followed, then equivalency certificate holders. 

Sex Versus Length of S ervice : (See Figures H.25 

through H.30, Appendix H) 

• Males outperformed females. 

• Diploma graduates (HSDG) outperformed all ether 
education levels; Non- high-school grads outper- 
formed the GID holders. 

• Non-high-school-graduate females outperformed male 

non-high-schcol-graduates and female high-schcol 
diploma graduates. However, non-h igh-schocl- 

graduate females displayed the greatest variance in 
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length of service, indicating a possible higher 
risk in attrition. 

• Non-high-schcol-graduate females were followed by 
GED males and then NHSG males in variance in length 
of service (risk). 

5. Bace Ver sus l ength of Se rvic e; (See Figures H.31 

through H.36, Appendix H) 

• In general, the "other" category were the highest 
performers, followed by blacks, and then whites. 

• High- school-diploma graduates were the highest 
performers in terms of length of service in all 
categories. 

• Non-higb-school-graduates outperformed the graduate 
eguivalency certificate holders (GED). 

• GED blacks displayed the highest variance in length 
of service, indicating a higher risk in attrition, 
followed by GED whites. 

6. Beenlistment Code Versus Education leve l: (See 

Figures H.37 through H. 39, Appendix H) 

• At least through the high school diploma graduate 
level of education, as education increased, reen- 
listment eligibility increased. 

• GED soldiers received about the same reenlistment 
codes as those soldiers who entered the army with 2 
years of high school. 

• A wide variance in reenlistment eligibility was 
observed, pcssibly related to the fact that reen- 
listment eligibility is mostly up to the local 
ccmmander’s discretion. 

7- Been listment Code Versus L eng th of S ervi ce: (See 

Figures H.39 through H.44, Appendix H) 

• Approximately 30S of the sample was uncoded at the 
time of the last file update, indicating that the 

, bar to reenlistment may not used as often as it 
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could be as a rehabilitative tool for substandard 
soldiers. 

• Generally, as length of service increased, sc did 
the number cf soldiers eligible to reenlist. 

• Approximately 50% of the GED holders and of the 
non-high-schcol-gr adua tes indicated a reenlistment 
code of 3 , corresponding to only 30% in the hirgh- 
school-diplcaa graduates. This code corresponds 
to a Department of the Army initiated bar to reen- 
listment. This proportion seems a hit unreasonable 
and more research seems necessary in this multivar- 
iate combination. 

• High-school-diploma graduates outperformed all 
others, followed by non- high-school-graduates and 
the graduate equivalency certificate holders. 

• Again, the GED holders demonstrated the largest 
variance (risk) in length of service, followed by 
non-hig h-schcol graduates. 

8* Char act er of S ervice Versus E ducation Level: (See 

Figures' H.45 through H.47, Appendix H) 

• Generally, as education level increased (at time of 
entry) through high school diploma, so did the 
character of service awarded. 

• GEE and non-tigh-s chool-graduate soldiers received 
about the same treatment in character of service 
awarded. Again, high-school-diploma graduates 
received the largest proportion of the honorable 
discharges. 

• College graduates outperformed the high-school- 
diploma grads; however those entrants with one and 
two years of college were outperformed in terms of 
character of service awarded by the non-high school 
diploma graduates. 
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• A wide variance was noted possibly because of the 
ccimanders* discretion allowed in determining what 
character cf service will be awarded tc an 
individual. 

9. C haract er of Se rvice Versus length of Ser v ice ; (See 

Figures H.48 through H. 54, Appendix H) 

• As length of service increased so did the number of 
honorable discharges awarded. 

• Over half cf the cohort received an honorable 
discharge. A large proportion of early "leavers” 
also received this honorable discharge, perhaps 
indicating that some commanders are very lenient in 
their determination of type discharge to be 
awarded. 

• High- school- dip lorn a graduates receiving honorable 
discharges performed the best in terms of length of 
service, again followed by non-high-schocl gradu- 
ates and the GED's. 

• A wider variance in length of service was noted for 

the non- high-school graduates and the GED*s who 
received honorable discharges, when compared to the 
high-school-diploma graduates. This perhaps 

relates again to the discretion that is exercised 
by the local commanders in awarding discharges. It 
seems that length of service may not be considered 
as an indicator of "good" service by a number of 
commanders in the field. 

C. ElAflflE COHFIBHAICEY ANALYSIS 

Ccnclusions drawn from the above explanatory data anal- 
ysis must be analyzed in a formal manner to determine if 
differences, say, in the mean performance among the varying 
levels of education at entry is statistically significant. 
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An exanfle is presented below; this type of analysis should 
te performed on all conclusions reached before any policy 
implementation. 

Ihe cne-way- analysis of variance provides a well- 
structured approach in testing the equality of means amcng k 
sample populations. In this approach, the k populations are 
assumed 

1. tc be i.i.d ncrmal populations, and 

2. tc have equal variances. 

As has teen pointed out, some of the boxplot analyses 
above has indicated that the equal variance assumption is 
obviously net true; contrarily, this difference in variance 
was was used as an indicator of the '»risk” involved in 
recruiting that entrant with his particular qualifications. 
In those cases, ncnparametric tests are available for 
confirmatory analysis. An example of that type will also be 
presented below. See [ Bef- 15: pp. 492-503] for a 

discussion on 1- way AKOVA. 

Ihe actual calculations of the ANOVA table were 
completed using an AEI program contained in public domain of 
the Naval Postgraduate School computer system (Library 
5,OA3660). A copy of the program is at Appendix I. Results 
of the analysis are summarized in Table V Thus the null 
hypothesis can be rejected at the .05 level of significance. 

In instances where the homogeneity of variance and 
normality of population assumptions are infeasible, nonpar- 
ametric tests can be utilized for confirmation of 
statistical significance. The Kr uskal-Hallis test was 
utilized tc test the following hypothesis: 

H0;mean services across all education levels are equal. 

B1: for at least one pair of the population represented 
by the levels of education, the means are different. 
See [Bef. 16: pp. 229-237] for a discussion cf the K-W 

test. 



65 



TABLE ? 

Ancva Table for Testing Equality of Heans 



2 ^ A H O V A M A T R 



ANOV A WAS 


U R jD A T E D 


1/3/79 y ANOVAHOW FOR 


C H A G E S ^ 






A (J O V A TABLE 






SOURCE 


r*F 


ss 


M5 


F 


T R £ A T M E T 4 


25933.42 


6433.35 


21.67 


£ R R O P: 


2998 


896813.33 


299.14 




TOTAL 


3002 


922746.75 






R-SOU ARE 


= 0.028 








OVERALL MEAN = 


31.93 






T F: E A T M £ ?-rr 


EFFECTS 


~4.76 2.69 " 


1.29 "3.24 


"12.73 



The actual calculations were performed utilizing the 
EMDP statistical scftware program P3S on the Naval 
Eostgiaduate School Computer System. See [Hef. 17: 
pp. 442-^l<J3 ] for a description cf this package. Results are 
summarized in Table VI . 



TABLE TI 

Besults of K-B Test of Equality of Heans 
Variable 1 LOS 

Group Frequency Rank Sum 

No. Name 



1 


SOPH 


10 


9064. 0 


2 


JR 


752 


1017353.5 




GED 


196 


250661.0 


4 


KHSG 


504 


730567.5 


c 


HSDG 


1546 


2517723.0 



Kruskal-Nallis Test Statistic = 75.13 
Level of Significance = 0.0000 

Using Chi-Square Cistributicn with 4 Degrees of Freedom 
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Ihus the null hypothesis can be rejected at the .05 
level of significance. Multiple comparisons may be calcu- 
lated in accordance with [Ref. 16: p.231] to determine 

which pairs of means are different if desired. 

E. SUHMAEY OF EXPLOEATORI DlTl ANALYSIS EFFORTS 
1 . G enera l 

Exploratory Eata Analysis technigues have been 
utilized in the forms of draftsman's displays and tcxplots 
to "preprocess" a large data set. This analysis was 
presented as a demonstration of the power of EDA technigues 
and tc provide initial insights into the attrition cf O.S. 
Army enlistees prior tc further analysis. These technigues 
have provided the following: 

1. Familiarity with data set. 

2. Reduction in the dimensionality of the data set; 
numerous variables were determined as having no 
appreciable effects on the dependent variable under 
consideration. 

3. Identification of erroneous data. 

4. Structure of tie data set and variable coding. 

5. Information on multivariate and pairwise associations 
among the variables. This information will be 
summarized below. 

6. An intuitively pleasing form of analysis tc assist 
analysts and decision makers in understanding the 
problem at hand. 

7. A means of allowing the analyst to "compose” his 
method of attack on a large problem in an interactive 
fashion. 
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2. EE A and O.S. A rm y E nlisted Attrition 

Hore specifically, in the investigation of attrition 
of O.S, Arny FY79 etlistees, the following information has 
keen revealed; 

1. The variables listed in Table Vll below were observed 
tc have some effect on a soldiers performance which 
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TABLE VII 

Possible Explanatory Variables from EDA 



Age 

Sex 

Bace 

Mental Category 

Marital Status/Number of Dependents (at entry) 

Education level 

Military Occupational Skill 



has been defined as his total length of service. 

2. Level of education at entry has an effect on the 
performance of enlistees. Education level seems to 
interact with the other variables listed in Table VII 
above, prodccing different levels of performance. 
Other insights provided by the analysis ^re 

• High school diploma graduates demonstrated better 
performance than did non-high-school-diploma gradu- 
ates and those enlistees who had obtained a 
graduate equivalency certificate prior to entry. 

• Non-high school diploma graduates demonstrated 
better performance than did GED holders. 

• GED holders demonstrated a larger variance in total 
service obtained, indicating a higher risk in 
attrition than did non-high school diploma 
graduates and high school diploma graduates. 
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3 . 



Character of Service and Seenlistment Code are tvo 
indicators of performance that may also be affected 
by education level at entry; however, the discretion 
exercised by local commanders in awarding these may 
have confounded them as suitable explanatory 
variables. 

4. Confirmatory analysis needs to be performed after 
exploratory data analysis prior to forming final 
ccnclusion for policy implementation. An example was 
provided. 

3 . l imitations 

The EIA techniques are not to he used in lieu of more clas- 
sical statistical analysis; they are to be used in 
conjunction with them. Some limitations are 

1. Acceptance of their use by other statisticians. 

2. Package utilixed in this thesis is in the experi- 
mental stage; others maj not be readily available to 
the analyst. Since this package is experimental, 
certain capabilities are still being developed. Por 
example, in the the ANCVA presented, the means used 
in the analysis were not stored in a global sense for 
further analysis and had to be entered "by hand" into 
an ANOVA program. 

3. Ccst of graphics capabilities for computer systems. 

4. Storage necessary for EDA packages is currently not 
available for most personal or desk top computers. 

Thus Exploratory Data Analysis has been shown to be 
a useful technique in the initial analysis of data. In the 
next chapters, a more formal analysis will be presented. 
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I¥. SC BYIVOB FONCT ION ANA LYSIS 



A. EACKGfiOONO 

Ihe Exploratory Data Aralysis techniques presented 
(along with the necessary ccnf irmatory analysis) have shewn 
that level of education, sex, race, age, mental category, 
and marital status/nunber of dependents are candidate expla- 
natorj variables in determining the total length of service 
for an enlistee, 

A survivor function approach is utilized to gain further , 
insight into these explanatory and their relationship to 
length of service. 

Suppose that the length of service of an enlistee is a 
random variable Y. Ihe cumulative distribution function or 
c.d.f., then, can be viewed as giving the probability that 
an enlistee will "die" or "fail” or leave the army before x 
units of time, a realization of the random variable X, have 
elapsed. Then the quantity 

Six) = 1-F(x) = P(X>x) (eqn 4,1) 

called the survivor function, provides the probability that 
an enlistee "survives” more than x units of time. The 
survivor function, for discrete data, is a step function, 
where the height of the jump between any two values of x is 
equal to E (X=x) . Ihe survivor function is estimated by 
using the following relative frequency definition of 
probability ; 

S (X) = 1-F(x) = (number of observations > x)/n (eqn 4,2) 
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where n is the nuaber ic the sample being considered 
[Eef. 18: pp. 92-93, 263-264]. 



B. AEEIICA7I0N 

Ihe survivor function is therefore a logical means of 
analyzing enlistee behavior with regards to length of 
service. lor this portion of the analysis, attrition will 

te defined as failure to complete the first term of service. 
Ihe Fy79 COHORT consists of three-year obligated enlistees 
(310) and fcur-year ctli gated enlistees (4Y0) . Each subset 
will te analyzed separately. Ihe reenlistment decision will 
be defined as ccmpleting greater than one term of service. 
Bence, based on the survival function model, the fcllcwing 
equations indicate the "life" cycle of the enlistee: 



TABLE VIII 

Enlistee Life Cycle Models 



3 5 ear E nlistees 

E (enlistee will attrite) = P(X<36 mos.) 

= 1-P(X>36 mos.) = 1-S(x) 

E(enlistee will complete 1 term) = P(X=36 mos.) 

= Hr. of jump at x=36 

P (enlistee will reenlist) = ? (X>36 mos.) = 1-S (x) 



4 Jear E nlistees 

E (enlistee will attrite) = P(X<48 mos.) 

= 1-P(x 5>46 mos.) = 1-S(x) 

E (enlistee will complete 1 term) = P(X=48 mos.) 

= Hr. of jump at x=4o 

E (enlistee will reenlist) = P(X>48 mos.) = 1-S (x) 
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These realization times for the random variable X are 
chosen *ust to demonstrate the survivor function method- 
ology. Considerabde research has been done in defining the 
exact tire cutoff for the definition of attrition, since the 
first "term” of service may actually be thirty-four months 
as opposed to thirty-six because of the various "early out" 
programs offered by different commands. This analysis does 
allow for at least this concept in its definition of attri- 
tion as strictly less than thirty-six and forty-eight months 
(i.e. thirty-five and forty-seven months) length of service 
for the three year and four year obligations respectively. 
The methodology suffices no matter where the attrition defi- 
nition is made in the length of service parameter. 

Survivor functions for each of the previously defined 
candidate explanatory variables are estimated utilizing the 
Cumulative Distribution input screen in the IBM GBAPSTAT 
data analysis package. (See Appendix J for a depiction of 
this screen.) These functions are analyzed for the above 
mentioned statistics. 

The survivor function of the entire sample across all 
variables is presented to demonstrate the characteristics of 
the analysis in Figure 4.1 . Using this type of analysis, 
statistics for the most prevalent enlistee are presented in 
Figure 4.2 and in Figure 4. 3 for 3 year enlistees and 4 year 
enlistees respectively. The "most prevalent enlistee" was 
determined by observing the total number in each of the 
seven variables being considered. Note that the four year 
obligated enlistees demonstrated higher probability of 
attrition (0.33 to 0.24), and a lower probability of reen- 
listment (0.12 to 0.37) than the three year obligated 
enlistee, ceteris paribus. This may indicate that the 
utilization of the three year term of service is more cost 
effective than the four year tern, considering 
"assess, dress, train" cost described earlier. Results for 
the most prevalent enlistee are summarized in Table IX . 
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SURVIVOR FUNCTION. N-2565 




SAMPLE GRAPHICAL CALCUUTIONS 

P(LENGTH OF SERVlCE«36MONTHS) 

-HEIGHT OF JUMP AT LOS-36 MONTHS 
0.559-0.320=0.239 

P(LENGTH OF SERVICE<36MONTHS) 

-VERTICAL DROP IN CURVE FROM 0 TO 35 MONTHS 
1.0-0.559-0.441 

P(L£NGTH OF SERV1CE>36M0NTHS) 

-VERTICAL DROP IN CURVE FROM 37 TO 59 MONTHS 
0.321-0.000=0.321 



Figure 4.1 Survivor Function For All 3 YO Enlistees 

Ihe effects of education level on attrition of three- 
year-ctligated enlistees are presented in Figure 4.4 
Those exlistees with a graduate equivalency certificate 
indicate a higher pretability of attrition (0.54) than both 
those with two years of high school and those with non-high- 
schocl-diplcma graduate (NHSDG) status, having three to four 
years of high school. These findings reinforce the earlier 
toxplot analysis of length of service. The trend is also 
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IXNW or SERVICE rOR MOST PREVALENT JYO ENUSTEE 

THIS BATCH CONSISTS OF THE MOST PREVALENT 
ENLISTEES: 18 YR OLD. COMBAT ARMS.3YO. SINGLE 

WITH NO DEPS, WHITE MALE 



P(L0S<36)=0.242 
P(LOS=36)=0.385 
P(L0S>36)=0.373 
AVERAGE LOS=35.529 



Figure 4.2 Survivor Function, Most Prevalent 310 Enlistee 

evident in the probability of reenlistment with the high- 
school-diplcma graduates having the highest, followed by the 
NHSDG, then those with 2 years high school and then those 
enlistees with equivalency certificates. Again the GED 
enlistee is seen tc be inferior to the non high school 
diploma graduat,e (NHSEG). Results of this survivor function 
analysis are in Table X in the following section. 
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'SURVIVOR FUNCTION. N*33 




LENGTH OF SERVICE FOR MOST PREVALENT 4YO ENUSTEE 



THIS BATCH CONSISTS OF THE MOST PREVALENT 4Y0 
ENLISTEE; 18 YR OLD COMBAT ARMS. SINGLE WITH 
NO DEPS.WHITE.MALE 

P(LOS<48)=0.334 
P^0S=48)=0.545 
P(LOS>48)=0.121 
AVERAGE L0S=40.939 



Figure 4.3 


Survivor Function^ Most 


Prevalent 410 Enlistee 




TABLE 11 




Besults of Survivor Analysis on Oost Prevalent 

Enlistee 


Term 


P(attrite) P(full term) 


P(Eeenlist) Ave LOS 


3 years 

4 years 


0.242 0.365 

0.334 0.545 


0.373 35.53 

0.121 40.94 
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Figure 4.4 Survivor Functions of Education Levels 
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M M40 M 

LENGTH or SERVICe IN MONTHS FOR >H$OC.SYO 



C. BESOIIS OF SOBVIT/CB FONCTICN ANALISIS 



Similar analyses were performed on the remaining six 
candidate explanatory wariables for loth the three year 
enlistees and for the four year enlistees. Actual survivor 
functions are found in Appendix K and L respectively. 
Tabular summaries are provided in Table X and XI , Based on 
these summary tables, the lowest risk attributes in terms of 
lowest probability of attrition are provided in Table XII , 
Those attributes that result in the highest probability of 
reenlistment are shown in Table XIV and Table XV . 

The usefulness and simplicity of this approach is 
appealing for "guick and dirty" analyses in determining 
short term policy decisions concerning the characteristics 
desired for prospective soldiers. 

Modeling of the survivor curves can provide further 
prediction capabilities and significance of each of the 
covariates. However , the survivor functions presented all 
demonstrate the large "jump" at thirty-six and forty-eight 
months for the three year and four year enlis- 

tees, respectively. This large "discontinuity" causes 
modelling of the survivor function to be somewhat difficult. 
Modelling of the survivor is discussed in great detail in 
[Ref. 19] and [Ref. 20]. The noted similarity in the 
survivor carves for all the variables under investigation 
suggest that the use cf a Cox proportional-hazards model may 
be appropriate; however more research is needed in the 
modeling of the discrete jump in the survivor curve. See 
£Ref« 20]. This analysis has been presented as an initial 
methcdolcgy to demonstrate its usefulness ; no modelling 
will be performed. 
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B€SUltS of 
V ariatle 


TABIE X 

Survivor Function Analysis for 3X0 
P(ICS<36) P (LOS=36) P (LOS>36) Ave. LCS 


All variables 


0.441 


0.239 


0.321 


3C.87 


Education Ivl 










1 vr High Sch 


0.778 


0.111 


0. 1 11 


20.11 


2 yrs HS 


0.503 


0.201 


0.296 


28.72 


CED 


0.542 


0. 184 


0.274 


27.09 


UHSDG 


C.461 


0.217 


0.322 


30.69 


ESDG 


0.366 


0.283 


0.351 


33.23 


College 


0.400 


0.400 


0.200 


32.80 


Ses 










Hale 


0.431 


0.246 


0.322 


31.17 


Female 


0.515 


0.175 


0.310 


28.39 


Race 

White 


0.467 


0.258 


0.275 


29.21 : 


Elack 


0.405 


0.204 


0.391 


33.18 1 


ether 


C.374 


0.238 


0.388 


34.02 


Mental Cat 










Cat 1 


0.357 


0.214 


0.429 


33.57 


Cat2 


0.401 


0.294 


0.305 


31.85 


Cat3a 


0.356 


0.320 


0.324 


32.40 


Cat3t 


0.457 


0.225 


0.318 


30.42 


Cat4a 


0.449 


0.230 


0.321 


2S.86 


Cat4b 


0.468 


0.222 


0.310 


30.80 


Cat4c 


0.443 


0. 193 


0.364 


32.57 


Mar. Stat/No, Deps 








Single/0 


0.441 


0.252 


0.307 


30.62 


Married/0 


C.448 


0.083 


0.469 


32.92 


Harried/1 


0.563 


0.000 


0.437 


30.19 


Married/2 


0.381 


0.095 


0.524 


36.76 


Married/3 


0.333 


0.083 


0.583 


42.00 


Military Skill 










11B 


0.461 


0.225 


0.314 


30.26 


13B 


0.464 


0. 195 


0.341 


30.73 


31M 


0.408 


0.241 


0.351 


31.07 


64C 


0.419 


0.302 


0.279 


31.16 


761 


0.402 


0.217 


0.381 


32.82 1 


S4E 


0.428 


C.260 


0.312 


30.85 


Age 










^ 17 


0.491 


0.215 


0.294 


29.65 


18 


0.432 


0.255 


0.313 


31.04 


19 


0.396 


0.266 


0.338 


32.09 


20 


0.482 


0.232 


0.286 


28.92 


21 


0.405 


0.250 


0.345 


31.61 


22 


0.495 


0.206 


0.2 99 


28.80 


23 


0.446 


0. 149 


0.405 


33.15 


24 


0.450 


0.225 


0.325 


32.18 


25 


0.470 


0. 118 


0.4 12 


32.77 


26 


0.556 


0.074 


0.370 


25.04 


27 


0.450 


0.150 


0.400 


30.80 


28 


0.364 


0.182 


0.454 


31.10 


29 


0.571 


0. 143 


0.286 


27.00 


30 


0.286 


0.000 


0.714 


53.14 


> 30 


0.524 


0.238 


0.238 


25.86 
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B€salts of 
Variatle 


TABLE XI 

Survivor Eurction Analysis for 
P (1CS<48) P(L0S=48) P (LOS>48) 


• 

4YO 

Ave. LCS 


All variables 


0.441 


0.239 


0.321 


30.87 


Edncation Ivl 










1 vr Hiqh Sen 


— 


— 


— 


2 vrs HS 


0.670 


0.110 


0.230 


V 31.78 , 


GED 


0.666 


0. 167 


0. 167 


29.67 


NHSEG 


0.500 


0.125 


0.375 


35.88 


BSDG 


0.367 


0.409 


0.223 


39.18 


Cclleqe 


0.600 


0.240 


0.160 


28.96 


Sen 










Male 


0.391 


0.389 


0.220 


38.29 


Eeniale 


0.667 


0.000 


0.333 


37.00 


Bace 










Bhite 


0.387 


0.389 


0.220 


37.74 


Black 


0.424 


0.288 


0.288 


39.36 


ether 


0.307 


0.500 


0. 193 


39.42 


Mertal Cat 










Cat 1 


0.333 


0.429 


0.238 


41.14 


Cat2 


0.376 


0.36 8 


0.256 


38.68 1 


Cat3a 


0.324 


0.485 


0.191 


39.78 


Cat3b 


0.427 


0.427 


0.146 


37.29 


Catha 


0.394 


0.346 


0.260 


38.10 


Cat4b 


0.465 


0.296 


0.239 


36.66 


Cat4c 


— 


— 


— — 


— 


Mar. Stat/No. Deps 








Single/0 


0.393 


0.402 


0.205 


37.86 


Married/0 


0.550 


0.150 


0.300 


38.85 


Married/1 


— 


— 


— 


— 


Married/2 


— — 


— 


— — 


— — 


Military Skill 








11B 


0.359 


0.412 


0.229 


39.43 


13B 


0.434 


0.352 


0.214 


36.73 


31M 


— — 


“ 


— — 


— 


64C 


— 


— 


— 


— 


76Y 


— 


—— 


— 


— 


S4E 


— 


— 


— 


— 


Age 










17 


0.418 


0.436 


0.145 


36.86 


18 


0.349 


0.429 


0.222 


39.56 


19 


0.353 


0.431 


0.216 


39.12 


20 


0.797 


0.018 


0. 185 


28.92 


21 


0.455 


0.394 


0.151 


36.73 


22 


0.476 


0.238 


0.286 


38.19 ; 


23 


0.662 


0.040 


0.298 


33.15 


24 


0.675 


0.050 


0.275 


32.18 


25 


0.647 


0.000 


0.353 


32.77 


26 


0.500 


0.125 


0.375 


33.50 


28 


— — 


— 


— — 


“ 


29 


— — 








30 


— 






— 


> 30 
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TABLE XII 

Attribute Values Providing Lowest BisJc of 
Attrition, 310 



Variatle Lowest £ (attrition) 


Next lowest P( attrition) 


Education 

Level 


HSLG 


Colleg e 


Sez 


Hale 


Female 


Race 


Other 


Black 


dental Cat. 


3A 


1 


Harital Stat/ 
No. of Deps 


M/2 


S/0 


Age 


28 


19 


MCS 


761 


31M 



TABLE XIII 

Attribute Values Providing Lowest Bisk of 
Attrition, 4x0 

Variatle Lowest E (attrition) Next lowest P( attrition) 


Education 

Level 


HSEG 


NHSDG 


Sez 


Hale 


Female 


Bace 


Otter 


White 


Mental Cat. 


3A 


1 


Marital Stat/ 
No. of Deps 


S/0 


M/0 


Age 


18 


19 


MCS 


1 1E 


31M 
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Attribute 

Variable 


TABLE XIV 

Values Providing Highest Reenlistment^l YO 
Lowest P (reenlist) Next lowest P (reenlist) 


Education 

Level 


HSEG 


NHSDG 


Ser 


Male ' 


Female 


Bace 


Black 


Other 


Mental Cat. 


1 


4C 


Marital Stat/ 

Nc. of Deps H/2 


M/2 


Age 


30 


28 


MCS 


761 


31M 



Attribute 

Variable 


TABLE XV 

Values Providing Highest Beenlistment « 4YO 
Lowest P (reenlist) Next lowest P (reenlist) 


Education 

Level 


NHSLG 


2 yrs HS 


Sex 


Female 


Male 


Race 


Black 


White 


Mental Cat. 


4A 


2 


Marital Stat/ 

No. of Deps M/0 


S/0 


Age 


26 


25 


MCS 


1 1E 


13B 



V. B ISDLTS AND CONCLUSIONS 



i. GENE6AI 

An intuitively pleasing , simple methodology was presented 
for the study of performance in the form of length of 
service of O.S Army enlistees. Some Exploratory Data 
Analysis technigues were demonstrated through the use of the 
IBM GEAFSTAI data analysis package. The interactive capa- 
tilities of the package and the APL language were exploited 
to provide a means of rapidly manipulating and observing the 
selected data. The tools proposed, the draftsman's 
displays, toxplots and survivor functions, were used on 
actual cohort data from the Defense Manpower Data Center. 

Several possible explanatory variables and their associ- 
ation with performance were presented based on the 
Exploratory Data Analysis. Confirmatory analysis was 

performed to support the Exploratory Data Analysis. 
Probabilities of enlistee attrition and reenlistment were 
provided using a survivor function analysis for each of the 
candidate explanatory variables. Attributes that presented 
the highest risk of attrition and the highest probability 
were presented- 

£. SOMMABI 

The increasing cost of "assessing, dressing and training" 
today's Army enlistee coupled with the diminishing supply of 
17-21 year old prospective enlistees have prompted research 
effort toward gaining insight into those personal attributes 
that produce the most successful soldier in terms of first 
term completion. The basis for understanding the relation- 
ships of these personal attributes and for using this 
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understanding in recruiting fclicy lies in the ability to 
rapidly analyze the available data on current enlistees and 
to present the analysis in a form that is understandable and 
useful for the decision maker. 

This thesis has presented a broadly applicable and 
simple methcdology, using Exploratory Data Analysis through 
the interactive capability of the IBM GRAFSTAT package and 
the All language, for defining the area of analysis, identi- 
fying errors in the data, reducing the dimensionality of the 
problem, and determining relevant association of personal 
characteristics of enlistees to performance. The bonds 
between Exploratory Eata Analysis and more classical statis- 
tical analysis were demonstrated. Use of survivor function 
analysis provided statistics on chosen explanatory vari- 
ables, indicating the importance of these characteristics. 

The further application of the methods in this thesis 
and of Exploratory Data Analysis in general should increase 
the practitioner's ability to make sound decisions regarding 
future manpower planning issues. With the increased avail- 
ability cf graphics-capable personal computers, Explcratcry 
Eata Analysis is relevant at all levels of decision making. 

C. BECOEHEIiDED FOBTEEB BESEABCB 

The following items deem further research: 

1. A comparison cf Exploratory Analysis technigues in 
this theses to other data analysis packages such as 
those available in BUEF and SAS (see [Ref. 17] and 
[Bef. 21] ) wculd be useful in determining advantages 
and disadvantages of the different approaches in 
variable selection and error identification. In 
particular, the Cox proportional hazards model in the 
BHDP program p21 [Ref- 17; pp. 576-594] uses a step- 
wise approach to identify important explanatory 
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variables and estimates the survivor as well as the 
hazard function for further analysis. Note however 
that models such as this Cox proportional hazards 
model for estimating the effect of concomitants on 
survival curves are not applicable because of jumps 
in the survival functions at known times. More 
research is needed to determine how to apply the 
model to such a function with this large discrete 
jump. 

2. The Graduate Equivalency Degree programs offered 
throughout the Onited States need to be analyzed in 
detail for standards used in awarding the certifi- 
cates. the wide variance and the poorer performance 
of the GED holders indicate that non-high-schocl- 
diplcma graduates should be treated separately in any 
analysis, contrary to the popular grouping of the two 
categories. Perhaps different GED levels would 
provide insight into future performance at least as 
well as the different levels of high school status 
have. 

3. Ihe trends and probabilities have been presented as a 
methodology on only a 10 percent sample of the data: 
comparisons of these outcomes to other data sets 
would be useful in the determination of prediction 
possibilities. 

4. Modeling of the survivor curves would provide a 
detailed account of the actual contribution of each 
explanatory variables using multivariate regression 
techniques. Again further research is needed in the 
applications of modeling techniques to survivor 
curves with the noticeable jumps at known times. 



APPJNDIX a 

EXPIOBATOEY DATA ANALYSIS TECHNIQUES 



Explcratory Data Ajialysis technigues are usually first 
attrihuted to John N. Tukey in his book by that title 
[Bef. 22]. Exploratory Data Analysis for the purposes of 
this thesis will be defined as "the activity of examining 
data, both graphically and through numerical summaries, for 
the purpose of revealing properties of the data itself, and 
with luck, of the processes giving rise to that data." 
[Bef. 23: p. 2]. Ihus EDA technigues can be thought of as 

"informal " technigues to examine the data prior to 
"formal", more classical analysis technigues, in order to 
prevent needless calculations irrelevant to the investiga- 
tion at hand. Quite often more can be learned about the 
data in this initial, informal look at the data. As 

Chambers et. al. points cut, graphical EDA methods are 
perhaps most effective in the initial glance at the data to 
limit the scope of the investigation to only those variables 
that are pertinent [Bef. 12]. These graphical methods allow 
the investigator to rapidly synthesize information, in a 
more efficient and intuitive manner perhaps than through 
methods available in commercial statistical packages that 
produce tabular data. 

Cne particular method of multivariate analysis is the 
multidimensional array of scatter plots called a "general- 
ized draftsman’s display"of the data [Bef. 12: pp. 136,145]. 
An example display is seen in Figure A.1 This figure demon- 
strates how the pairwise scatter plots are arranged so that 
"any adjacent pair of plots have an axis in common" 
[Bef. 12: p. 145]. All variables of interest, then, for 

the entire data set can be displayed as the first phase of 
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Figure A.1 



Draitsian Bis 



flay from Chambers et. 



al. 
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the investigation. Then one can rapidly and effectively 
deteriine if trends exist and for which specific pairwise 
association of variatles. 

Captain Malcolm Johnson, a student of Operaticns 

Besearch at the Naval Postgraduate School, has developed an 
fiPL program called "draftsman" that is imbedded in IBM’s 
GRAFSTAT package on the school's computer system (See 
[Bef. 24: pp. 13-17]. for information on use of GEAFSTAT) 
that organizes any data set into a draftsman's display. 
This program alsc allows for transformations of the data and 
for jittering of the data. His efforts have been published 
as a Master's thesis that includes a tutorial for use cf the 
"draftsman" program. [Bef. 13]. This program will be 
utilized in the initial phase of the data analysis efforts 
cf this thesis. 

Cf course, the draftsman's display is only the first 
step cf the analysis. If trends are evident, then further 
analysis shculd be performed utilizing more formal confirma- 
tory analysis to verify any graphically-determined associa- 
tions among the variables. 

The use of boxpicts is another EDA technique that is 
very useful in "taking an initial look" at the data. The 
boxplct is a "simple method of summarization". 

The upper and lower guartiles are depicted by the "tody" 
cf the tcx, the median is portrayed by a line, circle or 
other distinguishing mark as is the mean. Opper and lover 
adjacents are depicted at the end of lines extending from 
the body of the box. These terms are defined as the 
"largest observation that is less than or equal to the upper 
guartile plus 1.5 times the interquartile range, and 
smallest observation that is greater than or equal to the 
lower quartile minus 1.5 times the interquartile range," 
respectively. Values that fall outside the range cf adja- 
cent values are called outside values. These are plotted as 
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individual points. See Figure A. 2 for a depiction of the 
components of the boxplot. 

Ihe toxplot provides a rapid "impression” of the distri- 
bution of the data. The median, mean, and spread are all 
obvious. The length of the lines to the adjacent values 
demonstrate the "stretch" of the tails of the distribution. 
The individual points for the outside values allows the user 
of the plot to consider "outliers" although not every 
outside value is an outlier. 

The figure also allows for some determination of the 
symmetry of the distribution of the data, simply by viewing 
the symmetry of the body of the box about the median line or 
dot. 

These plots are useful when it is not feasible or neces- 
sary to capture all the details of a distribution, or when 
many distributions need be compared. The width of the box 
has nc significance. 

An excellent discussion of these and other EDA tech- 
niques is fcund in £Bef. 12], from which this description of 
boxplots was taken. 
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sample: box plot 

PLOT OF 50 POINTS 



• _ IFFtR OUARTILE * l.Sx INTERQUARTILE DISTANCE 
o 

V _ LARGEST VALUE i UPPER OUARTILE * INTERQUARTILE DISTANCE 




_ UPPER OUARTILE 
_ ^EDIAN 

_ LOWER OUARTILE ' ' 



_ smallest VALUE i LOWER OUARTILE - INTERQUARTILE DISTANCE 



° _ LOWER OUARTILE - 1.5* INTERQUARTILE DISTANCE 

t 



, INTERQUARTILE DISTANCE = UPPER OUARTILE - LOWER OUARTILE 



r’ 

I 

i 



: Example box plot for fifty data points from a regeneracive 
• simulation. The interquartile distance equals the estimated 
: upper quartlle minus the estimated lover quartile. The light 
circles are data points which fall between the largest value 
less than or equal to the upper quartile plus the inter- 
quartile distance and the upper quartile plus 1.5 times the 
interquartile distance. The dark circles are data with values 
above this latter point. Similarly for the lower part of 
the box plot. 



Pigure A. 2 



Ezaople Boxplot 
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APBEMDIX B 



HISTOGBAB OF FI79 NflSG ASSESSIONS 



T&BL£ \C* 
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1 
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1 
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2 
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1 
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* 
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1 
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2 


3.0 
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0.3 


7 
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4 
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0.0 


11 
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4 
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19 
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0.9 
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2.9 
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1 


0.0 
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2.5 


C5K 


1 


0.0 
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2.B 
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0.0 
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0.0 
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0.0 


360 


2.5 


IIT 


1 


0.3 


261 


2.9 


m 


2253 . 


17. ^ 


2619 


2C.6 


IIY 


3 


3.3 


2624 


23.7 


128 


243 


2.7 


2«72 


22.4 


12 : 


63 


0.5 


2332 


22.9 
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24.2 
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536 


4.2 


362 7 


2«.5 


1 JC 


3 
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74 
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15E 


93 
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22.1 


ISG 


1 


0.3 


4074 


22.1 
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0.1 
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32.1 
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26 


0.2 
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22.2 


16C 
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4127 
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43. 3 
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1 
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T40Lr NC. 1 

F»CS CISTRId’lTICN FOR ARf»Y ACCISSICNS, 
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f'os : 



TA 6 L 1 NO. 1 

:3T=?:out::n f-^r ar>*y nfs acc^ssicns. 



CC3^ 
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17 

1 
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ta8l: nc. 1 

?»C3 CIST=^I9UTI0M F CR AR>»Y NPS ACCESSIONS. 
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APPENDIX C 

FOBTHAB PKOGBAHS TO BEAD DATA 



c 

c 

//□THOMAS JCc ( , • D.TnOMAS* ,CL ASS=d 

//^«MAIN uKG-NPGVMl.iv72f 
// EXEC FCPlXCG 
//FOkT ,SYS IN JU w 

INTcGcK<^A SSAM,SSAN2 ,SSaNj, 

UuSYC fOCSMb f uuS CC tuC jY S i CLoHSi jCSlS 

iNTtbcR^H oLA'NKtNiNc 

Data 8LANK/» »/.NiNE/' -‘^»/ 

DC 300 1=1, J077ti, 10 



SSANl 

SSAN2 

SSAN3 

DCSYC 

DOSMO 

DCSOC 

DOSYS 

DOSMS 

DCSDS 



DLAN^ 

bLAN^ 

NIi4c 

-9 

-9 

-9 

-9 

-9 

-9- 



300 

100 

200 



READ ( 1, 100 i SSANL,SSAN2 , SSAN3 ,u OSYsJ ,DJSMb,J0S^o , 

> DOSYS, DOSMS, DOSCS 

wR IT Ei 2, 200) SSANl, SSa;>12 , jSANS , UCSr C ,^USMu , JuSi>G , 

> DOSYS, DLSrt:>,DOSCS 
CONT iNLE 

STOP 

FORMAT (3A3,1o9X, I 2 , 12 , i2 , a ?A , 12 , 1 2 , 12 ) 

F uRM AT (3 a3, 1a, I2, 1a, 1 2, IX, 12, 1a, i 2,1 a ,12, 1a, 12) 

END 



/^ 

//GC.FTOlFOOl CC IT = 3 23DV , vOl= SE K=MSO 2 o2 , D a:»P = I UL D ) , 

// DCo=(RECrM = F6,LRECL=32o,DLKSAZ£ = 127L*») , 

// DSNAM E = RiS. SI 972. CFT79 

//GO.FT02F00L JD J.^IT = 3 2:>0, VUL= SER=MVSDOa ,j 1 SP= ( Smk ) , 

// SPACc = <CYL ,IA,a) ) , DCo= (R£CFM=Fb,LR £CL= 27 , oLKSi 2c = l 90b2 ) , 
// OSNAME=S1972.o79D 
// 
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C This FROuHAM STRIPS TFjSt v'AkS TmaT CHA.wE 
C HENT, 

//OThOrtAS jCe { 1972faA^e)t» o.TnCFAS* fLL ASS^b 
//'-^^^AiN CRG = NPbVMi. 1972 F 
// EXEC FCFlXCb 
//FCKT *SYS IN LD ^ 



w>VcK T HE T cRh lF ENLIST 



1NTEG£R»4 SSAM,SSM2,SSAjO, 

C: TAFHC »HYcCUfPGu »rtiU ,CcPL fcTSYOt £ T Sfio ,s.HS v j » 

REQf iMFHSthYECS fPo:> »r<iS tLcP:> fET iYif cTS;*5StLrtSVSf 
^ RESf TAFf'LfHYcCL *PGl f^'SL »CcPL f 
^ ETSYL fcTSrtL fCHSVLfKEL 
1 NTElER*A oLANRtixlRc 
LATA ELANK/* */,NlNc/» 

DO aOO i=l»iJ77b* 10 



C 



C 



C 



3J0 

100 

200 



SSANl 


s 


BLANK 


S SAN 2 


= 


BLmNK 


SSAN3 


= 


NINE 


TAFMC 


= 


-9 


HYECu 




-9 


PGC 


s 


-9 


M SO 


s 


-9 


DEPQ 


= 


-9 


ETSYC 


= 


-9 


ETSML 




-9 


CFSVO 


= 


-9 


REG 


■S 


-9 


TAFMS 


s 


-9 


HYECS 


s 


-9 


PGS 


=■ 


-9 


MSS 


= 


-9 


DEPS 


■S 


-9 


ETSYS 


=. 


-9 


ETSMS 




-9 


ChSVS 




-9 


RES 


= 


-9 


T AFML 


s 


-9 


HYECL 


= 


-9 


PGL 


= 


-9 


HSL 


s 


-9 


DEPL 


s 


-9 


ETSYL 


= 


-9 


ETSML 


s 


-9 


ChSVL 


s 


-9 


REL 


= 


-9 



READ (1,100) SSAM ,SSANZ ,£SaN 3,T AFMu ,hY ECO, P Go ,M SL , 

^ OE Pu f c t S Yu , ET Sh C , Cn S vC , A Eu , 

♦ TAFMSfHYECS ,PGi ,MSS ,UwP S , ETuYi, E TSh S ,Cn SvS , rsE S , 

^ TAFMLfHYECLfPGL ,RSl ,lEPl tcTSYL, ETSrtLfcni»Vu,iNcu 

mR it E (2 ,200 ) SSAK1,SSAi 42 * £SAN3,T AF Ru ,hY ECu,PGG,M^ut 
^ 0£ PCj ,£ TSyu, t TSrt C fCrtS Vu , K cC , 

• TAFMS,nYEC^,PGS ,rtSS , ucP S ,c T S yS , E T S ,C hS */S , RE S , 

^ TAFMLfMYECLfPuL ,MSL ,bcPL,£;TSYLf ET;^<iL,LhSVLtKEL 

CCNT INLE 
STOP 

FORM AT (3A3,1^2X, 1 3,oa , 12 , 12, lX.ll ,12, 17 a, 12, 12, AX.ii, II, 
^IIX, 13 ,ca, 12, 12, lx, 11 ,12, liX,l2,I ^,AA,11 ,il,llAfiji, 
♦6A,12, 12, lA,I 1,12 ,l7x ,12 , 12, ^X, II ,11) 

FORMAT (3A3,1X, 

^13, lx, 12, lx, iZ, lx, 12, lx, 12, IX, 12, 1X,1 2, lx, 
*12,lX,12,lX,i3,lX,i2, IX, 12, lA, 12, lA, 
^I2,iX,12,lX,I2,lA,l2, lA,iZ,lA,I3, Ia,12,1a,12,1a, 

^12, lx, 12, lx, 12, lx ,12, 1 a, 1 4,1a, 

>2151 



/• 

//GC.FTOIFCOI DC J.UT = 3330V , Voi.-S£R = ^':.0z32, C ISP= ( ucu) , 
// DCb*(RECFM = Fb,LKECL=320,bLKSUE-=1271*,) , 

// DSKAME = ^SS.Sl9 72 .CFT79 

//G0.i^TG2F COl jD UiU T =j> 330, vCe=S £R=HV30 oA ,0 I SP= ( SHR ) , 



// SP ACE - (CYL I , DCB= ( REcFR = F6 ,lR £:Cl= 9 J , oLNi. 1 ZE- 1 9 J:jP ) , 

// DSNAM£=S1972.u79A 

// 
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C This FKu oS STrvlnS ? EtsSu Ai Ai. V AK F r>uA fci»T if<E 7^ v^uHufs T • 
//OThurtAS JUa ( lSi72f03ScJ ,* J-IrtUFAS* ,wi_ASS=o 
//^MAll^i Ui<b-NPoVMl . 19 72 F 
// EXEC FGRTXCG 
//FORT .SYS IN OD » 

IN T EGERV4 SSA M f 5 Sai) 42 » SS Ail3, A G5E» riYco*Scx t/\Acc*cIh 
® RcThfhS » AFwTT Gt EuTSTfHt »rtMlvAi.f fc AAn sT » 

® TU c t E Pu t b3(«tJS » T r<MbS 3 
INTEGEft'**^H 6 lA*<r»NIN£ 

JATA 6LAHK/* V..^lNc/* -9*/ 

UC 3G0 I-l»3077ti, 10 



S SAN 1 

SSAN2 

SSAN3 

AGEE 

hYEC 

SEX 

RACE 

ETh 

RETH 

MS 

AFOTTG 

ENTST 

Hi 

ml 

k^AlV 

mAIVAL 

EXAMS! 

TOE 

EPG 

BONUS 

TRM0S3 



BLANK 

duANK 

nine 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 



REAC( If 100 JSSANl, SSAN2 t S SAi'^3 fAb EE tH YEL t S Ex , RACE t ETn , 
<* REThfMSfAFOTTGf ENTS I »riT »»« fwAlv »i%AlVALfEXAMST t 
^ TOE » EPG f bUNU;> f T RMuS 3 

»»RITE(2 *200 )SSAM,SoA;^2 * SSan3 , A oEc , hYEC »SEa ,KAcE,ETh 
^ RETh fM^fAFuTTG* EixTST *HT *inT *«(Aiv f»«Al VALicaAMST » 

* TOE# EPGf BONUS * T RMUS3 
300 CONTINUE 
STOP 

100 FORMAT 13A3*23X. I2flX, 12* I If II, 12, il,12,4Afll,A0A, 

»*I l*I2*13*i2A*12*. 

^12*I2tEX*12,12*13X*ll *13X*l*i) 

200 FORMAT 13A3*1a, 

^12*lXtl2tlX*I2,lX*12, lx*12*lx*12* 1X*12,1X, 

^12, lx* 12* lx, l 2 , lx, 13, lx* I 2 fix* 12* lx,12*lA* 
XI2*1X*I2*1X,12*1A,1AJ 
END 



/« 

//GC.FTOlFOOl 00 UN 1 T = i 230V * VO l = SEk = RSJ 2 , 0 IS R- (UL 0 ) * 

// UCE=( RECFM=FD,LR£LL=32o,dLKEUE = 127iA* , 

// uSNAME = FSS .S19 72 .CI-T79 

//GO.FT C2F 00 1 JD Ui', lT = i 33 J * VuL = 5 ER-M^33 J A ,b 1 SP= ( bE^ , i\cE P ) , 

// S P ACE= (CYl ,1A**,) ) *OCb= (KECFH=FB**_(<ECL=oo*oLA.SiZc = l 9UJe I 
// USNAM£=S1'^72.C79B 

// 
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c 

//OTflUHAS JCc ( I972f b.ThO^AS• *CLmSS*6 

0RosNPbVMl.i972F 
n EXEC FCRIXCG 
//POHT .SYb IN Oi) » 

INTEGER*^ SSAM,SSAN2 »SSAN2, 

% CENk,CENO»ZiPi , ZIP2 ,haK,f.ECli,,TF ,ARCTP , APTm, 
w APTB,APTC»aPTJ, APlc , APT F , APtC» A PT n, APT i f mPT Jt APTi^t aPTL , 
AP T« ,APTN,APTu, APTP f AFEE^ 

1NTE6£R<‘4 dLAi4K,NlN£ 

DATA BLANK/* '/tNlNE/* -9* / ,N IN/ • -9* / 

DC 3Q0 1=1,30776, 10 



C 



C 



C 



300 

100 

200 



SSANl 


= 


BLANK 


SSAN2 


s 


OLANK 


S SAN 3 


- 


N1 Nc 


CENK 


— 


-9 


C End 


s. 


-9 


ZlPl 


— 


BLANK 


Z 1P2 




NIN 


HGR 


= 


-9 


RECID 


= 


-9 


TF 


s 


-9 


AFCTP 


s 


-9 


APIA 


s 


-9 


APTB 


s 


-9 


APTC 


s. 


-9 


APTO 


= 


-9 


APTE 


s 


-9 


APTF 


s 


-9 


APTG 


= 


-9 


APTH 


= 


-9 


APTI 


s 


-9 


APTJ 


s 


-9 


APTK 


- 


-9 


APTL 


s 


-9 


APTM 


s 


-9 


APTN 


= 


-9 


AFTO 


JS 


-9 


APTP 


= 


-9 


AF££S 


= 


-9 






R£ACU,iOO) SSAM,SSAr42,SSAiMJ, 
CENk ,CcND,ZlPl ,21P2 ,hOK, 
R£ClC,TF,AFgTP, APTA ,APTB , 
APTC,APTJ,APTE, APTF , 

APTG ,APTh,APT i , APTJ , APTK , 

APTL ,APTK,APTN, APTu,APTF 



«RITE(2,200) SSANl, SSAN2,SSAN3, 

* CENR,CEN0,Z1P1 ,ZiP2 ,hUR, 
k£CiC,TF,AFgIP,APTM ,APTB , 

^ APTC ,APTO,APTE, APTF , 

* APTG, APTH, APT 1 , APTJ , APTK , 

* APTL ,APTR,APTN, APTu ,APTP ,aF£ES 
CCNT INLE 

STOP 

F CRM AT (3A3, 12 ,11 , A3, A2,12,12X, II, 9 A, I 2,12, lA, 12,12,12, 12, 12,12 

« 1 2,12, 12, 12,12, 12 ,6 A, 12, 1 2 ,12,12, tOA, 121 

format 13A3, IX, / 

* 12,iX,12, U,2A3, IX, 12, IX , 12 ,lx,12 , IX, 12, IX, 

^12, IX, 12,1a, 12, lx ,12, 1 a, 12, lx, 12, lA, 

^1^, lx, 12, lx, 12, lx, 12, lx, 12, IX, 12, lx, 12, 1 a, 12, IX, 

C^12,1X, 12, IX, 12) 

END 



/* 

//GC-FTOIFCCI DC uU IT = 3 330V , wU L = E£R = RoJ 262, 0 i^P= (GL D) , 

// DC fa= ( k tCFM = Fo , LR tC L= d2 o ,bLKi 1 Z£ = 12 7 in J , 

// J6NAMd=PSS.Gl972.CFT79 

//GO .FT G2F CC 1 OD Of^lT = j 3^0, vlL=SER = M^SO 0*+ ,j 1 SP= ( ShR ) , 

// SP ACE = (CYL ,( 4,9) ) ,CC6= (R£CFP=FB,Lk ECL= d 6 , 6 LK6 1 ZE = 1 90n0 ) , 



// DSNAME=S1972.C79C 
// 



97 










jmi 










MIINDIX D 

API PROGEAHS FOB DATA UANIPOIATION 



VSTP-1P2CQ3V 
T7 S T R 1 P 2 

Ci: TAFMO^r.PAFTic;i 2 

122 H » ECO^-DRAFTlC f 2 J 
C3: p60^draftic;33 

142 MS 0 <-DP.AFT 1 C } 4 ;] 
cs: r.eFO^DPAFTic;53 

L6Z CHSV04-DRAFT1C 

Z71 REOfdraftic J73 
C83 ftGES<-I-RAFTlC f 83 

C93 HTEC«-I*RAFT 1 C ; 93 

C103 sEX 4 -r.RAFTi[;jio: 
C113 RACE^-DRAFTlCJlin 

CI 2 : eth«-e.raftic;i23 

C 133 MS<-r.RAFTic;i 33 
C 143 AFOTTG 4 -DPAFT 1 C f 143 
C 133 MOS<-r.RAFTic ; 153 

C163 A<-DRAFT2C ;23 
C173 E<^E'RAPT2Cf33 
:i83 c^r.RAFT 3 c J43 
C193 C‘4-I‘Rapt3c 



vRECOi>ecD3^ 

7 RECODE r« AT A f C ; D ; »<COr>E f TSTCF ; RI ATA 



ci: 


' INSERT 


THE 


COLUMN 


TO 


BE 


RECODED 


C2: 

ZZl 


c«.D 

Q4- ' INSERT 


THE 


DIGITS 


TO 


BE 


RECODED 


C43 

C52 


D<-C 

Of. » INSERT 


THE 


NUMBER 


TO 


BE 


R E C 0 !• E D 



C63 MCor«e<-a 

C73 TSTOR<-DATA 

C33 DATACJCTvDATAC JC3=D 

C93 DATAC ;c34-datac ;c3x/<cor«E 

C103 RDATA<-DATA 

C113 RDATAC?C3«-RDATAC;C3 + TSTORCrC3x(RD AT AC ;C3 ENCODE) 
l 123 redata^-rdat a 

C133 □«- ’ THE RECODED DATA IS NOW A GLOBAL VARIABLE CALLED REDATA 

9 
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;?AVE5CANC 03'!7 

^ R4-AVESCAM A ft P: A T ; C 1 J C 2 f f V A L J V 
C13 ftTHIS FUNCTION CHOOSES A SUSSEX OF AN AftPAr ANI« 

C23 ^CALCULATES THE MEAN OF THAT SUSSEX 

C33 Ql' INSERT THE COLUMN HUMSEP OF THE SELECTION VECTOR' 

C43 ci«-D 

C53 V<-ARRAYC ;ci 3 

C63 □<-' INSERT THE COLUMN NUMPER OF OF THE VAft YOU WANT AVEFAGET' 

C73 C24-Q 

C83 A4-ARRAY1: ; C23 

C93 Q«-' INSERT THE PESIREt' SELECTION VALUE FOR THE • SEL VECTOR' 

C103 VAL4-D 

C113 Sik<’ ( VsVAL ) /A 

C123 Q<-‘THE SUSSEX THAT lOU HAVE SELECTED IS STORED AS GLOBAL VARIADLE 
SEL ' 

C133 D^-'THE average of your selected personnel IS' 

C143 i§L) 



vCOLATCQ3y 

^7 COLAT 

Cl 3 I'RA^T14-A79[: j23 »*79l f 33 » A79C ; 43 »<^79C ;S3 »-79: J63 f=^79C*f 93 » A79C f 103 

C2 3 DRAFTi^ORAFTi ,D79 c J 2 3 r & 7 9 C f 3 3 » p 79 C » 4 3 » ^ 7 9 C J 5 3 r ^<7 9 C f 6 3 » ^ 7 9 C f S 3 

C3 3 draftk-df AFTi ,S79C ;93» P79C ; 193 

C4 3 PRAFT2«-A79C?23fC79[;»93fC79C?103»C79Cf ll3rC79C; 123 » C79C ; 1 3 3 » C 79 C ; 14 

3 

C53 DRAFT2<-DF:AFT2,C79C;iS3»C79C;i63»C79Cf 173fC79C;i83»C79Cf 19 3»C79C?20 

3 

C6 3 r»RAFT2<-C'RAFT2»c79C >213 »C79C J223 »C79C 5 233 f C79: ;24 3 

C73 DftAFTl<-6)( 15 3078 fC-PA»^Tl) 

C83 draft 2«-^( 17 3078 fr«RAFT2) 





7MATSLDCQ37 
y MATSLD 

C13 flSUILDS MATRIX FQF- USE IN ANOVA TESTING IN CHARTEP 3 

C23 POW 4 . “99999 ‘99999 ‘99999 ‘99999 ‘99999 

C33 matf.4. 1543 5 ^row 

C43 MATRC ; 134"GEE‘» < 196'V><<^trc > 13) 

C53 MATRC f 23 ^msdg 

C ^3 MATftc ; 33^HHSG, (503 4rWATR[; >33 ) 

C7 3 matf c > 43<--^p: » (75li««Tftc>43) 

C83 watrc>53<-50»^h, (io^^natrc f53) 

rf _ 
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ci: 

C2D 

L21 

ZA 2 

C5D 

C 61 

Z 71 

C8D 

C9D 

lIo: 

CUD 

C12D 



CID 

C2D 

C3D 

LAI 

L 51 

L 61 

L 71 

C8D 

C9D 

CIOj 

CUD 

C12D 

C13D 

C14D 



CID 

C2D 

C3D 

C4D 

C5D 

C6D 

C7D 

CSD 

C9D 

CIOD 

Cl 12 

CUD 

CUD 

C14D 



vPFfCCDD^ 

^ F-4.F P c DATA ; DftTAi J sel; NEQ36 f HLT36 ; HGT 36 ?E036»>-T36J«T36;rco36;FuT3 
6 J'^gt36 

flCALCULATES R E L , F P. E O ( P PO & A D I L I T Y ) FOR SURVIVAL ANALYSIS, 

□ <-'CHECK TO SEE IF THIS FM IS SET UR FOR 3 ^ OF 40 fO' 

AVE5CAN DATA 
DATAl^SEL 

?IE0 48<-f EQ4 0^ ( DATA1 s 48 ) /DATAl 
NLT48(-f LT4S<- (DATAl < 48 ) « 1 

NGT48<-/^GT36<-(DATAl,48)/t«ATAl 
P2«-PE048<-NEQ48-f SEL 
Rl<.FLT48<-NLT40-i.fSEL 
R34.PGT4a4-HGT40-f SEL 

Q<- ■ VECTOR OF PR O D A B I L I T I E S ; R ( L 0 S < 4 0 ) » P ( >- 0 S = 4 3 > , P < L 0 S > 4 8 ) ' 

P<-P1 ,P2»P3 



VY04MKRCnDV 
c T O 4 M K F r. A T A 

fl&UILDS MATRi:< OF 4 YO ONLY DATA. DATA IS ORIGINAL 
ft DAT A SET, 

Y04^C.ATAC U1D=4 

NDATA 4 t. 479 10 f ( 1 1 1 1 1 1 1 1 1 1 ) 

NDATA4C;i34-Y04/C'ATACflD 
ndata4C;2D<-»04/c^tac;23 
NDATA4C ;3 3<.Y04/t«ATAC J 3D 
NDATA4C J 43<-Y04/C'ftTAC »4D 
HDATA4C »5D<-'' 04/E'ATAC JSD 
NDATA4C » 6 Dp'» 04/datac ; 6D 
ND AT A4C ;7D^'^04/catac; 7D 
NDATA4C »8D*-YO4/riATACf03 
NDATA4[;93<-r04/DATAC}93 
.NDATA4C ; 10 34.Y04/I*ATAC ; lOD 

V . 

;?T 03 mkf CQD^ 

V T 0 3 M K R DATA 

ft&UILDS MATRIX OF 3 YO ONLY DATA, 'DATA* IS ORIGINAL 
ftDATA SET 

Y03^DATAC U1D=3 

NDATA4- 2565 10 f ( 1 1 1 1 1 1 1 1 1 1 ) 

NDATAC J n<-Y03/DATAC J 1 ] 

ndatac;23^i03/data[;;2D 

NDATAC;334-Y03/DATAr;33 

NDATAC f 4D<- »'0 3/DATAC ; 43 

hdatac;5D<''03/datac; 5D 
wt.ATAC J 6 D<-» 03/datac ; 6 D 

NDATAC ; 7Dp»03/datac j 73 

ndatac;234.t03/datac;83 

ndatac;9D<-»'03/datac;93 

NDATAC;iODt-Y03/DATAC;iOD 
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APPENDIX E 

OVEBAIL VIES OF FIBS! DBAFTSMAN*S DISPLAY 




TOTAL SEKVICC MS OffTAlHED PAY CWlOC MAWTAL STATUS HO. OCPS 
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MCS mCKTAl CATOO^t MM. STATUS AT rSiTPTY 
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APFEMDIX G 

INPDI SCBEEH, IBH GEAFSTAT SOBPOPOLATION CATEGOBY AHAIYSIS 

P60GBAE 



CATCQijry 

C*TtCO»T 
T vAitiAai; 

LWWCICwiEO CO) » ti^lOiTS 
SCLECTIOM 

pr«cfwTiLCS (Cff wt) 

snaoLS ocHNco 

PLOT *CAO£R (IN OUOT(S) 

SCKTCN ki.aC*€» (In QUOTES) 
*-A3tlS LA«l (IN QuOHS) 
V-AXIS L*8fl (IN QUOUS) 

Pt.O» POSITION « 

SCALE f-AJlS l!»i 

NAPTIAL «*10T 

Alts *K) C«13 CON’AOL 



AMkLTSlS 
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TABLE POSITlONr 
SCALE r-AirlS : LIN 

T 1 ! 
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CNTENH30 7^!CVI G»A«^lCS(327f) 3-«tTLAN4 A-A^tU « SC^ffN 

CUAlM3£fAU^ 5-LAST RtSR>«SCS «-EK«SC r-A»NOr|LE ^MCETlNtO 

^SPQNSES OUTPUT •OSTQNt/'NETPIEVt H-lNtO AFl l?-SC*tEN OIS^Ar 
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APPENDIX fl 

BOXPLOT ANAIYSIS CF BEMAINING VABIAELES 

Bcxplots cf the remaining candidate explacatcry 
variables versus length of service are provided in 
this appendix. Befer to Chapter 3, pages 62 
through 66 for discussion on each of these 
bcxplots. Bemaining candidate explanatory vari- 
ables displayed in this appendix are listed in 
Table XVI 



TABLE XVI 

Candidate Explanatory Variables 



Mental Category 
Marital Status 
Age 
Sex 

Hcl 06 

Beenlistnent Code 
Character of Service 
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Figure H.1 Mental Category vs. Length of Service 
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Figure H.2 Mental Category vs. Length of Service, II 
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Figure H.3 Mental Category vs. Length of Service, III 
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Figure H.4 Mental Category vs. Length of Service, IV 
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Figure H.5 Mental Category vs. Length of Service 
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Figure H.6 Hental Category vs. Length of Service, VI 
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Pigure H.7 Mental Category vs. Length c£ Service, VII 
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Figure H«8 neotal Category vs. Length c£ Service, VIII 
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Figure H.9 Mental Category vs. Length of Service, IX 
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Figure H. 10 Harital Status vs. Service 
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figure H. 11 Marital Status vs. Service, II 
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Figure H.12 Marital Status vs. Service, III 
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figure H. 13 Marital Status vs. Service, IV 
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figure H. 14 Marital Status vs. Service 
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figure H, 15 Marital Status vs. Service, VI 



ACE VS SERVICE ACE VS SERVICE.NPS ONLY 
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Figure H.16 Age vs. Service 
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Figure H. 17 Age vs. Service, II 
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Figure H. 10 Age vs. Service,!!! 
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Figure H. 19 Age vs. Service, IV 
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Figure B.21 Age vs. Service, VI 
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Figure H.22 Age vs. Service, VII 
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Figure H.23 Age vs. Service^VIII 
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Figure H.24 Age vs. Service, li 
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Figure H.25 Sex vs. Service 
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Figure H. 26 Sex vs. Service «II 
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Figure H. 27 Sex vs. Service «III 
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Figure U.28 Sex vs. Service, IV 
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Figure H.29 Sex vs. Service 
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Figure H.30 Sex vs. Service^VI 
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Figure H. 31 Bace vs. Service 
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Figure H. 32 Race vs. Service, 11 
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Figure H.33 Race vs. Service^lII 
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Figure H.34 Bace vs. Service, IV 
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Figure H. 35 Race vs. Service 
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Figure H. 36 Race vs. Service, VI 
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Figure B.37 fieenlistaent Code vs. Education Level, I 
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Figure H.38 Reenlistment Code vs. Education Level, II 
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figure B.39 Beenlistioent Code vs. Service 
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Figure H.40 Beeulistment Cole vs. Service, II 
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Figure H.41 Eeenlistment Code vs. Service, III 
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Figure H.42 fieenlistment Code vs. Service, IV 
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figure fi.43 Beenlistoent Code vs. Service 
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Figure H.44 fieenlistment Code vs. Service, VI 
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Figure G.45 Character of Service vs. Education Level, I 
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Figure fl. 46 Character of Service vs. Education Level, II 



CHARACTER OF SERVICE VS. EDUCATION LEVEL CHARACTER OF SERVICE VS. EDUCATION LEVEL 

t OUTLIER ED. LVL. DELETED . COLLEGE DEIETEO 



r- 

I 









— o o o o o 



o — ^ « 

^ • «n 



I 

1 

I 




5 s? 
< — 
? n 



i 

r~ 

w 

s 

-I 

Jr 

at5 

s 

o 

5 5 

3 Z 









O — O O O — O — O Ol 



3,sa?:;2rcs 



^ « 5 

^ <** — O « 



^ ^ ^ 

SA7SSlS8S 

<^000000000 

^o - — ^1© 






- I 



162 



Figure H.47 Character of Service vs. Education Level, III 
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Figure H. 48 Character of Service vs. Service 
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CHARACTER Of SERVICE VS. LENGTH OF SERVICE CHARACTER OF SERVICE VS. LENGTH OF SERVICE 
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Figure H.50 Character of Service vs. Service, III 
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Figure fl. 51 Character of Service vs. Service, IV 
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Figure H. 52 Character of Service vs. Service 
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Figure H. 53 Character of Service vs. Service^VI 
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Figure H.54 Character of Service vs. Service, VII 
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APPENDIX J 

INPUT SCREEN FOR IBM GRIFSTAT CDF PROGRAM 
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AP PENDIX K 

S0B7I70B CDB7ES FOB 3 lEAB ENLISTEES 

This Appendix contains survivor curves for FY79 3 
year-obligated enlistees for the six candidate 
explanatory variables listed in- Table X7II below. 
Tabular summaries of the analysis and discussion of 
the analysis is provided in Chapter 4 of this 
thesis. 



TABLE X7II 

Candidate Explanatory 7ariables 



Sex 

C0 

Mental Category 

Marital Status/Number of Dependents 
Age 

Mxlitary Cccupational Skill 
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Figure K.1 Sex 




Figure K.2 Race 
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Figure K. 4 Marital Status/ Naober of Dependents 
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Figure K.7 Military Occupational Skill 






APPENDIX L 

S0B7I70R C0E7ES FOB 4 TEAR ENLISTEES 

This Appendix contains survivor curves for FY79 4 
year-obligated enlistees for the six candidate 
explanatory variables listed in Table XVII telow. 
Tabular sumiaries cf the analysis and discussion of 
the analysis is provided in Chapter 4 of this 
thesis. 



TABLE X7III 

Candidate Explanatory 7ariables 



Sex 

Race 

Mental Category 

Marital Status/ Number of Dependents 
Age 

Mxlitary Cccupational Skill 
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Figure L. 1 Education Level 
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Figure L.2 Sex 
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Figure L.3 Race 






9t 

3 



X 

i 




186 



Figure L. 4 Mental Categor]^ 
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Figure L. 5 Marital Status/ Number oi Dependents 
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Figure L. 7 Age II 
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Figure L.8 Military Occupational Skill 
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